Overview
Brought to you by YData
Dataset statistics
| Number of variables | 60 |
|---|---|
| Number of observations | 5019782 |
| Missing cells | 166704518 |
| Missing cells (%) | 55.3% |
| Total size in memory | 2.2 GiB |
| Average record size in memory | 480.0 B |
Variable types
| Text | 60 |
|---|
Dataset
| Description | Naturalis Biodiversity Center (NL) - Botany 0061690-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.4ze7ns |
license has constant value "CC0 1.0" | Constant |
rightsHolder has constant value "Naturalis Biodiversity Center" | Constant |
institutionID has constant value "https://ror.org/0566bfb96" | Constant |
collectionCode has constant value "Botany" | Constant |
basisOfRecord has constant value "PreservedSpecimen" | Constant |
samplingEffort has constant value "0.0 m" | Constant |
island has constant value "51.41942" | Constant |
countryCode has constant value "WGS84" | Constant |
maximumDistanceAboveSurfaceInMeters has constant value "Asia" | Constant |
geodeticDatum has constant value "WGS84" | Constant |
coordinateUncertaintyInMeters has constant value "South-Western" | Constant |
verbatimCoordinates has constant value "Siam [Thailand], Kwae Noi Basin Expedition, near Neeckey, near Wangka." | Constant |
verbatimSRS has constant value "150.0 m" | Constant |
geologicalContextID has constant value "15.1" | Constant |
earliestEonOrLowestEonothem has constant value "98.46667" | Constant |
latestEonOrHighestEonothem has constant value "WGS84" | Constant |
identificationVerificationStatus has constant value "Fungi-Ascomycota" | Constant |
identificationRemarks has constant value "Lichenes-Lecanoromycetes" | Constant |
namePublishedIn has constant value "species" | Constant |
subgenus has constant value "Fimbristylis bisumbellata (Forssk.) Bubani" | Constant |
vernacularName has constant value "Plantae" | Constant |
nomenclaturalCode has constant value "ICN" | Constant |
nomenclaturalStatus has constant value "Poales" | Constant |
otherCatalogNumbers has 3742862 (74.6%) missing values | Missing |
eventDate has 856201 (17.1%) missing values | Missing |
habitat has 4109448 (81.9%) missing values | Missing |
samplingEffort has 5019781 (> 99.9%) missing values | Missing |
continent has 902365 (18.0%) missing values | Missing |
island has 5019781 (> 99.9%) missing values | Missing |
countryCode has 5019780 (> 99.9%) missing values | Missing |
stateProvince has 3065004 (61.1%) missing values | Missing |
locality has 763737 (15.2%) missing values | Missing |
verbatimElevation has 3265629 (65.1%) missing values | Missing |
maximumDistanceAboveSurfaceInMeters has 5019781 (> 99.9%) missing values | Missing |
decimalLatitude has 2925885 (58.3%) missing values | Missing |
decimalLongitude has 2925885 (58.3%) missing values | Missing |
coordinateUncertaintyInMeters has 5019781 (> 99.9%) missing values | Missing |
verbatimCoordinates has 5019781 (> 99.9%) missing values | Missing |
verbatimSRS has 5019781 (> 99.9%) missing values | Missing |
geologicalContextID has 5019781 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 5019781 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 5019781 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 5019780 (> 99.9%) missing values | Missing |
bed has 5019780 (> 99.9%) missing values | Missing |
typeStatus has 4932431 (98.3%) missing values | Missing |
identifiedBy has 4152104 (82.7%) missing values | Missing |
dateIdentified has 4581006 (91.3%) missing values | Missing |
identificationReferences has 5019780 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 5019781 (> 99.9%) missing values | Missing |
identificationRemarks has 5019781 (> 99.9%) missing values | Missing |
taxonID has 5019780 (> 99.9%) missing values | Missing |
acceptedNameUsageID has 5019780 (> 99.9%) missing values | Missing |
namePublishedInID has 5019780 (> 99.9%) missing values | Missing |
parentNameUsage has 5019780 (> 99.9%) missing values | Missing |
namePublishedIn has 5019780 (> 99.9%) missing values | Missing |
phylum has 4742156 (94.5%) missing values | Missing |
class has 4741605 (94.5%) missing values | Missing |
order has 143842 (2.9%) missing values | Missing |
subgenus has 5019781 (> 99.9%) missing values | Missing |
specificEpithet has 420613 (8.4%) missing values | Missing |
infraspecificEpithet has 4607995 (91.8%) missing values | Missing |
scientificNameAuthorship has 355313 (7.1%) missing values | Missing |
vernacularName has 5019781 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 5019781 (> 99.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 15:41:55.321161 |
|---|---|
| Analysis finished | 2025-01-14 15:44:32.671775 |
| Duration | 2 minutes and 37.35 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 5019782 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 5019782 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2514633172 |
|---|---|
| 2nd row | 2980371442 |
| 3rd row | 2514602651 |
| 4th row | 2980366433 |
| 5th row | 2514610075 |
| Value | Count | Frequency (%) |
| 2514633172 | 1 | < 0.1% |
| 2980357438 | 1 | < 0.1% |
| 2516414075 | 1 | < 0.1% |
| 2980344448 | 1 | < 0.1% |
| 2516430099 | 1 | < 0.1% |
| 2980380439 | 1 | < 0.1% |
| 2516309267 | 1 | < 0.1% |
| 2980358429 | 1 | < 0.1% |
| 2514610078 | 1 | < 0.1% |
| 2516623054 | 1 | < 0.1% |
| Other values (5019772) | 5019772 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 8927787 | |
| 2 | 7962993 | |
| 1 | 7915696 | |
| 4 | 4128324 | |
| 3 | 4110468 | |
| 6 | 4020457 | |
| 7 | 3910237 | |
| 0 | 3130209 | 6.2% |
| 8 | 3047255 | 6.1% |
| 9 | 3044394 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50197820 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 8927787 | |
| 2 | 7962993 | |
| 1 | 7915696 | |
| 4 | 4128324 | |
| 3 | 4110468 | |
| 6 | 4020457 | |
| 7 | 3910237 | |
| 0 | 3130209 | 6.2% |
| 8 | 3047255 | 6.1% |
| 9 | 3044394 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50197820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 8927787 | |
| 2 | 7962993 | |
| 1 | 7915696 | |
| 4 | 4128324 | |
| 3 | 4110468 | |
| 6 | 4020457 | |
| 7 | 3910237 | |
| 0 | 3130209 | 6.2% |
| 8 | 3047255 | 6.1% |
| 9 | 3044394 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50197820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 8927787 | |
| 2 | 7962993 | |
| 1 | 7915696 | |
| 4 | 4128324 | |
| 3 | 4110468 | |
| 6 | 4020457 | |
| 7 | 3910237 | |
| 0 | 3130209 | 6.2% |
| 8 | 3047255 | 6.1% |
| 9 | 3044394 | 6.1% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0 1.0 |
|---|---|
| 2nd row | CC0 1.0 |
| 3rd row | CC0 1.0 |
| 4th row | CC0 1.0 |
| 5th row | CC0 1.0 |
| Value | Count | Frequency (%) |
| cc0 | 5019782 | |
| 1.0 | 5019782 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 10039564 | |
| 0 | 10039564 | |
| 5019782 | ||
| 1 | 5019782 | |
| . | 5019782 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15059346 | |
| Uppercase Letter | 10039564 | |
| Space Separator | 5019782 | 14.3% |
| Other Punctuation | 5019782 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10039564 | |
| 1 | 5019782 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 10039564 |
Space Separator
| Value | Count | Frequency (%) |
| 5019782 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5019782 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25098910 | |
| Latin | 10039564 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10039564 | |
| 5019782 | ||
| 1 | 5019782 | |
| . | 5019782 |
Latin
| Value | Count | Frequency (%) |
| C | 10039564 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35138474 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 10039564 | |
| 0 | 10039564 | |
| 5019782 | ||
| 1 | 5019782 | |
| . | 5019782 |
rightsHolder
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Naturalis Biodiversity Center |
|---|---|
| 2nd row | Naturalis Biodiversity Center |
| 3rd row | Naturalis Biodiversity Center |
| 4th row | Naturalis Biodiversity Center |
| 5th row | Naturalis Biodiversity Center |
| Value | Count | Frequency (%) |
| naturalis | 5019782 | |
| biodiversity | 5019782 | |
| center | 5019782 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 20079128 | |
| t | 15059346 | |
| r | 15059346 | |
| e | 15059346 | |
| 10039564 | 6.9% | |
| s | 10039564 | 6.9% |
| a | 10039564 | 6.9% |
| d | 5019782 | 3.4% |
| C | 5019782 | 3.4% |
| y | 5019782 | 3.4% |
| Other values (7) | 35138474 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 120474768 | |
| Uppercase Letter | 15059346 | 10.3% |
| Space Separator | 10039564 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 20079128 | |
| t | 15059346 | |
| r | 15059346 | |
| e | 15059346 | |
| s | 10039564 | |
| a | 10039564 | |
| d | 5019782 | 4.2% |
| y | 5019782 | 4.2% |
| v | 5019782 | 4.2% |
| o | 5019782 | 4.2% |
| Other values (3) | 15059346 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5019782 | |
| N | 5019782 | |
| B | 5019782 |
Space Separator
| Value | Count | Frequency (%) |
| 10039564 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 135534114 | |
| Common | 10039564 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 20079128 | |
| t | 15059346 | |
| r | 15059346 | |
| e | 15059346 | |
| s | 10039564 | 7.4% |
| a | 10039564 | 7.4% |
| d | 5019782 | 3.7% |
| C | 5019782 | 3.7% |
| y | 5019782 | 3.7% |
| v | 5019782 | 3.7% |
| Other values (6) | 30118692 |
Common
| Value | Count | Frequency (%) |
| 10039564 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 145573678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 20079128 | |
| t | 15059346 | |
| r | 15059346 | |
| e | 15059346 | |
| 10039564 | 6.9% | |
| s | 10039564 | 6.9% |
| a | 10039564 | 6.9% |
| d | 5019782 | 3.4% |
| C | 5019782 | 3.4% |
| y | 5019782 | 3.4% |
| Other values (7) | 35138474 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 25 |
| Min length | 25 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | https://ror.org/0566bfb96 |
|---|---|
| 2nd row | https://ror.org/0566bfb96 |
| 3rd row | https://ror.org/0566bfb96 |
| 4th row | https://ror.org/0566bfb96 |
| 5th row | https://ror.org/0566bfb96 |
| Value | Count | Frequency (%) |
| https://ror.org/0566bfb96 | 5019782 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 15059346 | |
| r | 15059346 | |
| 6 | 15059346 | |
| t | 10039564 | 8.0% |
| o | 10039564 | 8.0% |
| b | 10039564 | 8.0% |
| h | 5019782 | 4.0% |
| p | 5019782 | 4.0% |
| s | 5019782 | 4.0% |
| : | 5019782 | 4.0% |
| Other values (6) | 30118692 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70276948 | |
| Decimal Number | 30118692 | |
| Other Punctuation | 25098910 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 15059346 | |
| t | 10039564 | |
| o | 10039564 | |
| b | 10039564 | |
| h | 5019782 | 7.1% |
| p | 5019782 | 7.1% |
| s | 5019782 | 7.1% |
| g | 5019782 | 7.1% |
| f | 5019782 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 15059346 | |
| 0 | 5019782 | 16.7% |
| 5 | 5019782 | 16.7% |
| 9 | 5019782 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 15059346 | |
| : | 5019782 | 20.0% |
| . | 5019782 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 70276948 | |
| Common | 55217602 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 15059346 | |
| t | 10039564 | |
| o | 10039564 | |
| b | 10039564 | |
| h | 5019782 | 7.1% |
| p | 5019782 | 7.1% |
| s | 5019782 | 7.1% |
| g | 5019782 | 7.1% |
| f | 5019782 | 7.1% |
Common
| Value | Count | Frequency (%) |
| / | 15059346 | |
| 6 | 15059346 | |
| : | 5019782 | 9.1% |
| . | 5019782 | 9.1% |
| 0 | 5019782 | 9.1% |
| 5 | 5019782 | 9.1% |
| 9 | 5019782 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125494550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 15059346 | |
| r | 15059346 | |
| 6 | 15059346 | |
| t | 10039564 | 8.0% |
| o | 10039564 | 8.0% |
| b | 10039564 | 8.0% |
| h | 5019782 | 4.0% |
| p | 5019782 | 4.0% |
| s | 5019782 | 4.0% |
| : | 5019782 | 4.0% |
| Other values (6) | 30118692 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Botany |
|---|---|
| 2nd row | Botany |
| 3rd row | Botany |
| 4th row | Botany |
| 5th row | Botany |
| Value | Count | Frequency (%) |
| botany | 5019782 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 5019782 | |
| o | 5019782 | |
| t | 5019782 | |
| a | 5019782 | |
| n | 5019782 | |
| y | 5019782 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25098910 | |
| Uppercase Letter | 5019782 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5019782 | |
| t | 5019782 | |
| a | 5019782 | |
| n | 5019782 | |
| y | 5019782 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 5019782 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30118692 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 5019782 | |
| o | 5019782 | |
| t | 5019782 | |
| a | 5019782 | |
| n | 5019782 | |
| y | 5019782 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30118692 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 5019782 | |
| o | 5019782 | |
| t | 5019782 | |
| a | 5019782 | |
| n | 5019782 | |
| y | 5019782 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 785 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 5018997 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 25094985 | |
| r | 10037994 | 11.8% |
| P | 5018997 | 5.9% |
| s | 5018997 | 5.9% |
| v | 5018997 | 5.9% |
| d | 5018997 | 5.9% |
| S | 5018997 | 5.9% |
| p | 5018997 | 5.9% |
| c | 5018997 | 5.9% |
| i | 5018997 | 5.9% |
| Other values (2) | 10037994 | 11.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75284955 | |
| Uppercase Letter | 10037994 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25094985 | |
| r | 10037994 | 13.3% |
| s | 5018997 | 6.7% |
| v | 5018997 | 6.7% |
| d | 5018997 | 6.7% |
| p | 5018997 | 6.7% |
| c | 5018997 | 6.7% |
| i | 5018997 | 6.7% |
| m | 5018997 | 6.7% |
| n | 5018997 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 5018997 | |
| S | 5018997 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85322949 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 25094985 | |
| r | 10037994 | 11.8% |
| P | 5018997 | 5.9% |
| s | 5018997 | 5.9% |
| v | 5018997 | 5.9% |
| d | 5018997 | 5.9% |
| S | 5018997 | 5.9% |
| p | 5018997 | 5.9% |
| c | 5018997 | 5.9% |
| i | 5018997 | 5.9% |
| Other values (2) | 10037994 | 11.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85322949 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 25094985 | |
| r | 10037994 | 11.8% |
| P | 5018997 | 5.9% |
| s | 5018997 | 5.9% |
| v | 5018997 | 5.9% |
| d | 5018997 | 5.9% |
| S | 5018997 | 5.9% |
| p | 5018997 | 5.9% |
| c | 5018997 | 5.9% |
| i | 5018997 | 5.9% |
| Other values (2) | 10037994 | 11.8% |
occurrenceID
Text
Unique 
| Distinct | 5019782 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 81 |
|---|---|
| Median length | 61 |
| Mean length | 61.70256258 |
| Min length | 58 |
Unique
| Unique | 5019782 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851604 |
|---|---|
| 2nd row | https://data.biodiversitydata.nl/naturalis/specimen/L%20%200971472 |
| 3rd row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851644 |
| 4th row | https://data.biodiversitydata.nl/naturalis/specimen/L%20%200971531 |
| 5th row | https://data.biodiversitydata.nl/naturalis/specimen/L.2851686 |
| Value | Count | Frequency (%) |
| https://data.biodiversitydata.nl/naturalis/specimen/wag.1226003 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/wag.1816421 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/wag0454007 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.4308389 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/wag0100360 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200981551 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l%20%200820195 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/wag.1250897 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.4434831 | 2 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/l.4373010 | 2 | < 0.1% |
| Other values (5019737) | 5019762 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 30118729 | 9.7% |
| t | 30118693 | 9.7% |
| / | 25098912 | 8.1% |
| i | 25098910 | 8.1% |
| s | 20079128 | 6.5% |
| n | 15059347 | 4.9% |
| e | 15059347 | 4.9% |
| d | 15059346 | 4.9% |
| . | 14670101 | 4.7% |
| l | 10039580 | 3.2% |
| Other values (55) | 109331320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 220870604 | |
| Other Punctuation | 45484356 | 14.7% |
| Decimal Number | 36335937 | 11.7% |
| Uppercase Letter | 7042491 | 2.3% |
| Connector Punctuation | 13 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3368280 | |
| A | 1011325 | 14.4% |
| G | 896119 | 12.7% |
| W | 896053 | 12.7% |
| U | 640126 | 9.1% |
| M | 115226 | 1.6% |
| D | 115202 | 1.6% |
| N | 21 | < 0.1% |
| P | 21 | < 0.1% |
| S | 21 | < 0.1% |
| Other values (13) | 97 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30118729 | |
| t | 30118693 | |
| i | 25098910 | |
| s | 20079128 | |
| n | 15059347 | 6.8% |
| e | 15059347 | 6.8% |
| d | 15059346 | 6.8% |
| l | 10039580 | 4.5% |
| r | 10039564 | 4.5% |
| p | 10039564 | 4.5% |
| Other values (10) | 40158396 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5550526 | |
| 2 | 4766114 | |
| 0 | 4160945 | |
| 3 | 3919832 | |
| 4 | 3413591 | |
| 7 | 2998891 | |
| 5 | 2992564 | |
| 6 | 2885873 | |
| 9 | 2833494 | |
| 8 | 2814107 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 25098912 | |
| . | 14670101 | |
| : | 5019783 | 11.0% |
| % | 695511 | 1.5% |
| ! | 47 | < 0.1% |
| ' | 1 | < 0.1% |
| @ | 1 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 227913095 | |
| Common | 81820318 | 26.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30118729 | |
| t | 30118693 | |
| i | 25098910 | |
| s | 20079128 | |
| n | 15059347 | 6.6% |
| e | 15059347 | 6.6% |
| d | 15059346 | 6.6% |
| l | 10039580 | 4.4% |
| r | 10039564 | 4.4% |
| p | 10039564 | 4.4% |
| Other values (33) | 47200887 |
Common
| Value | Count | Frequency (%) |
| / | 25098912 | |
| . | 14670101 | |
| 1 | 5550526 | 6.8% |
| : | 5019783 | 6.1% |
| 2 | 4766114 | 5.8% |
| 0 | 4160945 | 5.1% |
| 3 | 3919832 | 4.8% |
| 4 | 3413591 | 4.2% |
| 7 | 2998891 | 3.7% |
| 5 | 2992564 | 3.7% |
| Other values (12) | 9229059 | 11.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 309733413 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 30118729 | 9.7% |
| t | 30118693 | 9.7% |
| / | 25098912 | 8.1% |
| i | 25098910 | 8.1% |
| s | 20079128 | 6.5% |
| n | 15059347 | 4.9% |
| e | 15059347 | 4.9% |
| d | 15059346 | 4.9% |
| . | 14670101 | 4.7% |
| l | 10039580 | 3.2% |
| Other values (55) | 109331320 |
catalogNumber
Text
Unique 
| Distinct | 5019782 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 9.425454532 |
| Min length | 6 |
Unique
| Unique | 5019782 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | L.2851604 |
|---|---|
| 2nd row | L 0971472 |
| 3rd row | L.2851644 |
| 4th row | L 0971531 |
| 5th row | L.2851686 |
| Value | Count | Frequency (%) |
| l | 285081 | 5.3% |
| u | 62704 | 1.2% |
| 04 | 7 | < 0.1% |
| 0012538 | 3 | < 0.1% |
| 3 | < 0.1% | |
| 0228872 | 2 | < 0.1% |
| 0004574 | 2 | < 0.1% |
| 0229129 | 2 | < 0.1% |
| 0004635 | 2 | < 0.1% |
| 0256210 | 2 | < 0.1% |
| Other values (4994407) | 5019794 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5550526 | |
| . | 4630537 | |
| 2 | 4070617 | |
| 3 | 3919831 | 8.3% |
| 0 | 3465436 | 7.3% |
| 4 | 3413591 | 7.2% |
| L | 3368280 | 7.1% |
| 7 | 2998891 | 6.3% |
| 5 | 2992563 | 6.3% |
| 6 | 2885861 | 6.1% |
| Other values (48) | 10017594 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34944917 | |
| Uppercase Letter | 7042489 | 14.9% |
| Other Punctuation | 4630591 | 9.8% |
| Space Separator | 695497 | 1.5% |
| Lowercase Letter | 196 | < 0.1% |
| Connector Punctuation | 13 | < 0.1% |
| Modifier Symbol | 12 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3368280 | |
| A | 1011325 | 14.4% |
| G | 896119 | 12.7% |
| W | 896053 | 12.7% |
| U | 640126 | 9.1% |
| M | 115226 | 1.6% |
| D | 115202 | 1.6% |
| P | 21 | < 0.1% |
| S | 21 | < 0.1% |
| N | 21 | < 0.1% |
| Other values (13) | 95 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5550526 | |
| 2 | 4070617 | |
| 3 | 3919831 | |
| 0 | 3465436 | |
| 4 | 3413591 | |
| 7 | 2998891 | |
| 5 | 2992563 | |
| 6 | 2885861 | |
| 9 | 2833494 | |
| 8 | 2814107 |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 93 | |
| g | 43 | |
| a | 37 | 18.9% |
| l | 16 | 8.2% |
| o | 2 | 1.0% |
| u | 1 | 0.5% |
| e | 1 | 0.5% |
| t | 1 | 0.5% |
| n | 1 | 0.5% |
| v | 1 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4630537 | |
| ! | 47 | < 0.1% |
| / | 2 | < 0.1% |
| ' | 1 | < 0.1% |
| : | 1 | < 0.1% |
| \ | 1 | < 0.1% |
| @ | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 695497 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 12 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40271042 | |
| Latin | 7042685 | 14.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 3368280 | |
| A | 1011325 | 14.4% |
| G | 896119 | 12.7% |
| W | 896053 | 12.7% |
| U | 640126 | 9.1% |
| M | 115226 | 1.6% |
| D | 115202 | 1.6% |
| w | 93 | < 0.1% |
| g | 43 | < 0.1% |
| a | 37 | < 0.1% |
| Other values (23) | 181 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 | 5550526 | |
| . | 4630537 | |
| 2 | 4070617 | |
| 3 | 3919831 | |
| 0 | 3465436 | |
| 4 | 3413591 | |
| 7 | 2998891 | |
| 5 | 2992563 | |
| 6 | 2885861 | |
| 9 | 2833494 | |
| Other values (15) | 3509695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47313727 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5550526 | |
| . | 4630537 | |
| 2 | 4070617 | |
| 3 | 3919831 | 8.3% |
| 0 | 3465436 | 7.3% |
| 4 | 3413591 | 7.2% |
| L | 3368280 | 7.1% |
| 7 | 2998891 | 6.3% |
| 5 | 2992563 | 6.3% |
| 6 | 2885861 | 6.1% |
| Other values (48) | 10017594 |
recordNumber
Text
| Distinct | 2852768 |
|---|---|
| Distinct (%) | 56.8% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 121 |
|---|---|
| Median length | 104 |
| Mean length | 21.23713803 |
| Min length | 1 |
Unique
| Unique | 2358777 ? |
|---|---|
| Unique (%) | 47.0% |
Sample
| 1st row | Unknown s.n. |
|---|---|
| 2nd row | Zainoeddin bb 17357 |
| 3rd row | Wijk, JH van s.n. |
| 4th row | Unknown bb 17412 |
| 5th row | Koster, JT 6255 |
| Value | Count | Frequency (%) |
| s.n | 1517120 | 7.6% |
| unknown | 403748 | 2.0% |
| van | 402082 | 2.0% |
| de | 306350 | 1.5% |
| a | 267054 | 1.3% |
| j | 265883 | 1.3% |
| m | 160895 | 0.8% |
| h | 141882 | 0.7% |
| p | 138822 | 0.7% |
| c | 138103 | 0.7% |
| Other values (172568) | 16227490 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14949623 | 14.0% | |
| n | 6590984 | 6.2% |
| e | 6135300 | 5.8% |
| , | 5592121 | 5.2% |
| a | 4397488 | 4.1% |
| s | 3844877 | 3.6% |
| r | 3532381 | 3.3% |
| o | 3476858 | 3.3% |
| . | 3240362 | 3.0% |
| i | 2918794 | 2.7% |
| Other values (129) | 51926994 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47775564 | |
| Uppercase Letter | 18878006 | 17.7% |
| Space Separator | 14949650 | 14.0% |
| Decimal Number | 13997887 | 13.1% |
| Other Punctuation | 10496024 | 9.8% |
| Dash Punctuation | 387395 | 0.4% |
| Open Punctuation | 59284 | 0.1% |
| Close Punctuation | 59257 | 0.1% |
| Connector Punctuation | 2158 | < 0.1% |
| Math Symbol | 316 | < 0.1% |
| Other values (6) | 241 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 6590984 | |
| e | 6135300 | |
| a | 4397488 | |
| s | 3844877 | 8.0% |
| r | 3532381 | 7.4% |
| o | 3476858 | 7.3% |
| i | 2918794 | 6.1% |
| l | 2230606 | 4.7% |
| t | 2053581 | 4.3% |
| d | 1674628 | 3.5% |
| Other values (46) | 10920067 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 1947721 | 10.3% |
| H | 1300682 | 6.9% |
| A | 1296584 | 6.9% |
| S | 1281050 | 6.8% |
| B | 1243337 | 6.6% |
| M | 1192210 | 6.3% |
| C | 1016865 | 5.4% |
| W | 948390 | 5.0% |
| P | 926499 | 4.9% |
| R | 844491 | 4.5% |
| Other values (28) | 6880177 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5592121 | |
| . | 3240362 | |
| ; | 1574515 | 15.0% |
| / | 48042 | 0.5% |
| ' | 30407 | 0.3% |
| ! | 6979 | 0.1% |
| : | 2233 | < 0.1% |
| ? | 666 | < 0.1% |
| \ | 277 | < 0.1% |
| & | 192 | < 0.1% |
| Other values (6) | 230 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2197925 | |
| 2 | 1669670 | |
| 3 | 1475003 | |
| 4 | 1361893 | |
| 5 | 1307307 | |
| 6 | 1259786 | |
| 7 | 1213233 | |
| 0 | 1176749 | |
| 8 | 1175255 | |
| 9 | 1161066 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 57600 | |
| [ | 1683 | 2.8% |
| { | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14949623 | ||
| 27 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 57575 | |
| ] | 1682 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 286 | |
| = | 30 | 9.5% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 81 | |
| ¼ | 4 | 4.7% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 6 | |
| º | 2 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 387395 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2158 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 145 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66653578 | |
| Common | 39952204 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 6590984 | 9.9% |
| e | 6135300 | 9.2% |
| a | 4397488 | 6.6% |
| s | 3844877 | 5.8% |
| r | 3532381 | 5.3% |
| o | 3476858 | 5.2% |
| i | 2918794 | 4.4% |
| l | 2230606 | 3.3% |
| t | 2053581 | 3.1% |
| J | 1947721 | 2.9% |
| Other values (86) | 29524988 |
Common
| Value | Count | Frequency (%) |
| 14949623 | ||
| , | 5592121 | 14.0% |
| . | 3240362 | 8.1% |
| 1 | 2197925 | 5.5% |
| 2 | 1669670 | 4.2% |
| ; | 1574515 | 3.9% |
| 3 | 1475003 | 3.7% |
| 4 | 1361893 | 3.4% |
| 5 | 1307307 | 3.3% |
| 6 | 1259786 | 3.2% |
| Other values (33) | 5323999 | 13.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106364932 | |
| None | 240848 | 0.2% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14949623 | 14.1% | |
| n | 6590984 | 6.2% |
| e | 6135300 | 5.8% |
| , | 5592121 | 5.3% |
| a | 4397488 | 4.1% |
| s | 3844877 | 3.6% |
| r | 3532381 | 3.3% |
| o | 3476858 | 3.3% |
| . | 3240362 | 3.0% |
| i | 2918794 | 2.7% |
| Other values (77) | 51686144 |
None
| Value | Count | Frequency (%) |
| é | 66202 | |
| ü | 43282 | |
| ö | 22422 | 9.3% |
| á | 20043 | 8.3% |
| è | 16968 | 7.0% |
| í | 12114 | 5.0% |
| ñ | 10827 | 4.5% |
| ó | 8648 | 3.6% |
| ß | 8310 | 3.5% |
| ë | 5441 | 2.3% |
| Other values (40) | 26591 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 | |
| ” | 1 |
recordedBy
Text
| Distinct | 101508 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 11448 |
| Missing (%) | 0.2% |
| Memory size | 38.3 MiB |
Length
| Max length | 108 |
|---|---|
| Median length | 96 |
| Mean length | 14.60004524 |
| Min length | 1 |
Unique
| Unique | 37149 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Zainoeddin |
| 3rd row | Wijk JH van |
| 4th row | Unknown |
| 5th row | Koster JT |
| Value | Count | Frequency (%) |
| unknown | 403744 | 2.9% |
| van | 402079 | 2.9% |
| de | 306325 | 2.2% |
| j | 264889 | 1.9% |
| a | 210805 | 1.5% |
| m | 155481 | 1.1% |
| al | 137949 | 1.0% |
| h | 137879 | 1.0% |
| r | 135106 | 1.0% |
| p | 133204 | 0.9% |
| Other values (40914) | 11811523 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9090878 | 12.4% | |
| e | 6123944 | 8.4% |
| n | 5043429 | 6.9% |
| a | 4353925 | 6.0% |
| r | 3514483 | 4.8% |
| o | 3466711 | 4.7% |
| i | 2903594 | 4.0% |
| s | 2322445 | 3.2% |
| l | 2220840 | 3.0% |
| t | 2048480 | 2.8% |
| Other values (115) | 32033174 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44430714 | |
| Uppercase Letter | 17477628 | 23.9% |
| Space Separator | 9090903 | 12.4% |
| Other Punctuation | 1761774 | 2.4% |
| Dash Punctuation | 261926 | 0.4% |
| Decimal Number | 45250 | 0.1% |
| Open Punctuation | 25796 | < 0.1% |
| Close Punctuation | 25788 | < 0.1% |
| Connector Punctuation | 2002 | < 0.1% |
| Math Symbol | 119 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6123944 | |
| n | 5043429 | |
| a | 4353925 | |
| r | 3514483 | 7.9% |
| o | 3466711 | 7.8% |
| i | 2903594 | 6.5% |
| s | 2322445 | 5.2% |
| l | 2220840 | 5.0% |
| t | 2048480 | 4.6% |
| d | 1649098 | 3.7% |
| Other values (46) | 10783765 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 1943668 | 11.1% |
| H | 1259497 | 7.2% |
| M | 1172359 | 6.7% |
| A | 1164819 | 6.7% |
| B | 1112442 | 6.4% |
| S | 1108388 | 6.3% |
| C | 988237 | 5.7% |
| W | 913411 | 5.2% |
| R | 795966 | 4.6% |
| P | 782981 | 4.5% |
| Other values (28) | 6235860 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1574505 | |
| . | 156410 | 8.9% |
| ' | 29989 | 1.7% |
| ? | 412 | < 0.1% |
| / | 271 | < 0.1% |
| & | 104 | < 0.1% |
| ! | 49 | < 0.1% |
| ¡ | 30 | < 0.1% |
| : | 3 | < 0.1% |
| … | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9720 | |
| 9 | 8640 | |
| 6 | 5668 | |
| 7 | 5037 | |
| 4 | 4020 | |
| 8 | 3781 | 8.4% |
| 0 | 2227 | 4.9% |
| 3 | 2127 | 4.7% |
| 2 | 2056 | 4.5% |
| 5 | 1974 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 9090878 | ||
| 25 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 100 | |
| = | 19 | 16.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 261926 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25796 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25788 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2002 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61908342 | |
| Common | 11213561 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6123944 | 9.9% |
| n | 5043429 | 8.1% |
| a | 4353925 | 7.0% |
| r | 3514483 | 5.7% |
| o | 3466711 | 5.6% |
| i | 2903594 | 4.7% |
| s | 2322445 | 3.8% |
| l | 2220840 | 3.6% |
| t | 2048480 | 3.3% |
| J | 1943668 | 3.1% |
| Other values (84) | 27966823 |
Common
| Value | Count | Frequency (%) |
| 9090878 | ||
| ; | 1574505 | 14.0% |
| - | 261926 | 2.3% |
| . | 156410 | 1.4% |
| ' | 29989 | 0.3% |
| ( | 25796 | 0.2% |
| ) | 25788 | 0.2% |
| 1 | 9720 | 0.1% |
| 9 | 8640 | 0.1% |
| 6 | 5668 | 0.1% |
| Other values (21) | 24241 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72889532 | |
| None | 232369 | 0.3% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9090878 | 12.5% | |
| e | 6123944 | 8.4% |
| n | 5043429 | 6.9% |
| a | 4353925 | 6.0% |
| r | 3514483 | 4.8% |
| o | 3466711 | 4.8% |
| i | 2903594 | 4.0% |
| s | 2322445 | 3.2% |
| l | 2220840 | 3.0% |
| t | 2048480 | 2.8% |
| Other values (68) | 31800803 |
None
| Value | Count | Frequency (%) |
| é | 66202 | |
| ü | 43282 | |
| ö | 22417 | 9.6% |
| á | 20043 | 8.6% |
| è | 16968 | 7.3% |
| í | 12114 | 5.2% |
| ñ | 10827 | 4.7% |
| ó | 8648 | 3.7% |
| ë | 5441 | 2.3% |
| ä | 4932 | 2.1% |
| Other values (35) | 21495 | 9.3% |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 | |
| ” | 1 |
Missing 
| Distinct | 1247855 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 3742862 |
| Missing (%) | 74.6% |
| Memory size | 38.3 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 10 |
| Mean length | 11.00010729 |
| Min length | 1 |
Unique
| Unique | 1228633 ? |
|---|---|
| Unique (%) | 96.2% |
Sample
| 1st row | L 0215467 |
|---|---|
| 2nd row | L 0215532 |
| 3rd row | L 0204325 |
| 4th row | L 0542724 |
| 5th row | L 0973113 |
| Value | Count | Frequency (%) |
| l | 605823 | |
| 176059 | 7.3% | |
| u | 146244 | 6.1% |
| uw | 6074 | 0.3% |
| b | 4013 | 0.2% |
| a | 2407 | 0.1% |
| 0 | 681 | < 0.1% |
| k | 377 | < 0.1% |
| jan.99 | 305 | < 0.1% |
| okt.00 | 265 | < 0.1% |
| Other values (1309143) | 1457050 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2090566 | |
| 1874427 | ||
| 1 | 1087243 | 7.7% |
| 2 | 1003157 | 7.1% |
| 3 | 926296 | 6.6% |
| 9 | 902389 | 6.4% |
| 4 | 845700 | 6.0% |
| 5 | 836296 | 6.0% |
| 8 | 818449 | 5.8% |
| 6 | 793280 | 5.6% |
| Other values (69) | 2868454 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10037815 | |
| Space Separator | 1874427 | 13.3% |
| Uppercase Letter | 1854729 | 13.2% |
| Math Symbol | 176021 | 1.3% |
| Lowercase Letter | 95261 | 0.7% |
| Other Punctuation | 7322 | 0.1% |
| Dash Punctuation | 645 | < 0.1% |
| Modifier Symbol | 30 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 606318 | |
| A | 335894 | |
| W | 323267 | |
| G | 322647 | |
| U | 196893 | 10.6% |
| D | 15749 | 0.8% |
| M | 9666 | 0.5% |
| F | 6715 | 0.4% |
| O | 5973 | 0.3% |
| H | 5717 | 0.3% |
| Other values (16) | 25890 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 67201 | |
| e | 2900 | 3.0% |
| u | 2872 | 3.0% |
| i | 2400 | 2.5% |
| a | 2331 | 2.4% |
| n | 1916 | 2.0% |
| j | 1796 | 1.9% |
| p | 1596 | 1.7% |
| t | 1593 | 1.7% |
| l | 1545 | 1.6% |
| Other values (15) | 9111 | 9.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2090566 | |
| 1 | 1087243 | |
| 2 | 1003157 | |
| 3 | 926296 | |
| 9 | 902389 | |
| 4 | 845700 | |
| 5 | 836296 | |
| 8 | 818449 | 8.2% |
| 6 | 793280 | 7.9% |
| 7 | 734439 | 7.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6475 | |
| : | 671 | 9.2% |
| / | 151 | 2.1% |
| ? | 7 | 0.1% |
| ! | 5 | 0.1% |
| , | 4 | 0.1% |
| * | 4 | 0.1% |
| \ | 3 | < 0.1% |
| ' | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 176015 | |
| + | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 | |
| ) | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1874427 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 645 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 30 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12096267 | |
| Latin | 1949990 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 606318 | |
| A | 335894 | |
| W | 323267 | |
| G | 322647 | |
| U | 196893 | 10.1% |
| w | 67201 | 3.4% |
| D | 15749 | 0.8% |
| M | 9666 | 0.5% |
| F | 6715 | 0.3% |
| O | 5973 | 0.3% |
| Other values (41) | 59667 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 0 | 2090566 | |
| 1874427 | ||
| 1 | 1087243 | |
| 2 | 1003157 | |
| 3 | 926296 | |
| 9 | 902389 | |
| 4 | 845700 | |
| 5 | 836296 | |
| 8 | 818449 | 6.8% |
| 6 | 793280 | 6.6% |
| Other values (18) | 918464 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14046257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2090566 | |
| 1874427 | ||
| 1 | 1087243 | 7.7% |
| 2 | 1003157 | 7.1% |
| 3 | 926296 | 6.6% |
| 9 | 902389 | 6.4% |
| 4 | 845700 | 6.0% |
| 5 | 836296 | 6.0% |
| 8 | 818449 | 5.8% |
| 6 | 793280 | 5.6% |
| Other values (69) | 2868454 |
eventDate
Text
Missing 
| Distinct | 67961 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 856201 |
| Missing (%) | 17.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 11.80088967 |
| Min length | 10 |
Unique
| Unique | 6056 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1933-04-24 |
|---|---|
| 2nd row | 1956-05-14 |
| 3rd row | 1939-05-21 |
| 4th row | 1955-04-26 |
| 5th row | 1838-05-01/1838-05-31 |
| Value | Count | Frequency (%) |
| 1859-01-01/1859-12-31 | 5064 | 0.1% |
| 1857-01-01/1857-12-31 | 3575 | 0.1% |
| 1898-01-01/1898-12-31 | 3352 | 0.1% |
| 1922-10-01/1922-10-31 | 2927 | 0.1% |
| 1912-01-01/1912-12-31 | 2915 | 0.1% |
| 1840-01-01/1840-12-31 | 2864 | 0.1% |
| 1880-01-01/1880-12-31 | 2677 | 0.1% |
| 1893-01-01/1893-12-31 | 2625 | 0.1% |
| 1909-01-01/1909-12-31 | 2617 | 0.1% |
| 1900-01-01/1900-12-31 | 2597 | 0.1% |
| Other values (67951) | 4132368 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 9868786 | |
| - | 9690462 | |
| 0 | 7477809 | |
| 9 | 5666608 | |
| 2 | 3122979 | 6.4% |
| 8 | 2608202 | 5.3% |
| 3 | 2323651 | 4.7% |
| 6 | 2166097 | 4.4% |
| 7 | 2156107 | 4.4% |
| 5 | 1905253 | 3.9% |
| Other values (2) | 2148006 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 38761848 | |
| Dash Punctuation | 9690462 | 19.7% |
| Other Punctuation | 681650 | 1.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9868786 | |
| 0 | 7477809 | |
| 9 | 5666608 | |
| 2 | 3122979 | 8.1% |
| 8 | 2608202 | 6.7% |
| 3 | 2323651 | 6.0% |
| 6 | 2166097 | 5.6% |
| 7 | 2156107 | 5.6% |
| 5 | 1905253 | 4.9% |
| 4 | 1466356 | 3.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9690462 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 681650 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49133960 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 9868786 | |
| - | 9690462 | |
| 0 | 7477809 | |
| 9 | 5666608 | |
| 2 | 3122979 | 6.4% |
| 8 | 2608202 | 5.3% |
| 3 | 2323651 | 4.7% |
| 6 | 2166097 | 4.4% |
| 7 | 2156107 | 4.4% |
| 5 | 1905253 | 3.9% |
| Other values (2) | 2148006 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49133960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 9868786 | |
| - | 9690462 | |
| 0 | 7477809 | |
| 9 | 5666608 | |
| 2 | 3122979 | 6.4% |
| 8 | 2608202 | 5.3% |
| 3 | 2323651 | 4.7% |
| 6 | 2166097 | 4.4% |
| 7 | 2156107 | 4.4% |
| 5 | 1905253 | 3.9% |
| Other values (2) | 2148006 | 4.4% |
habitat
Text
Missing 
| Distinct | 339001 |
|---|---|
| Distinct (%) | 37.2% |
| Missing | 4109448 |
| Missing (%) | 81.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 24282 |
|---|---|
| Median length | 668 |
| Mean length | 39.6989929 |
| Min length | 1 |
Unique
| Unique | 233613 ? |
|---|---|
| Unique (%) | 25.7% |
Sample
| 1st row | Old forest |
|---|---|
| 2nd row | Old forest Very scanty |
| 3rd row | Old forest, steep ridge |
| 4th row | Old forest, clayey soil, sloping country, scanty |
| 5th row | Degrade forest |
| Value | Count | Frequency (%) |
| forest | 416697 | 7.7% |
| in | 196774 | 3.7% |
| on | 168551 | 3.1% |
| of | 89312 | 1.7% |
| soil | 89138 | 1.7% |
| primary | 81686 | 1.5% |
| with | 73747 | 1.4% |
| secondary | 71636 | 1.3% |
| the | 66144 | 1.2% |
| and | 64092 | 1.2% |
| Other values (93782) | 4061990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4478314 | 12.4% | |
| e | 3514056 | 9.7% |
| r | 2590696 | 7.2% |
| a | 2510661 | 6.9% |
| o | 2468359 | 6.8% |
| n | 2068334 | 5.7% |
| s | 1995921 | 5.5% |
| t | 1847880 | 5.1% |
| i | 1812691 | 5.0% |
| d | 1411076 | 3.9% |
| Other values (158) | 11441355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28588916 | |
| Space Separator | 4478314 | 12.4% |
| Other Punctuation | 1470486 | 4.1% |
| Uppercase Letter | 1324005 | 3.7% |
| Decimal Number | 116345 | 0.3% |
| Dash Punctuation | 77829 | 0.2% |
| Open Punctuation | 30001 | 0.1% |
| Close Punctuation | 29898 | 0.1% |
| Control | 12008 | < 0.1% |
| Math Symbol | 10534 | < 0.1% |
| Other values (8) | 1007 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3514056 | |
| r | 2590696 | 9.1% |
| a | 2510661 | 8.8% |
| o | 2468359 | 8.6% |
| n | 2068334 | 7.2% |
| s | 1995921 | 7.0% |
| t | 1847880 | 6.5% |
| i | 1812691 | 6.3% |
| d | 1411076 | 4.9% |
| l | 1409280 | 4.9% |
| Other values (47) | 6959962 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 172219 | |
| O | 129329 | 9.8% |
| P | 103042 | 7.8% |
| F | 82210 | 6.2% |
| I | 81943 | 6.2% |
| R | 75431 | 5.7% |
| A | 72320 | 5.5% |
| D | 69603 | 5.3% |
| C | 68369 | 5.2% |
| M | 67914 | 5.1% |
| Other values (34) | 401625 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 943776 | |
| , | 401980 | |
| ; | 77327 | 5.3% |
| ' | 17235 | 1.2% |
| / | 11370 | 0.8% |
| : | 7625 | 0.5% |
| & | 3770 | 0.3% |
| ? | 3374 | 0.2% |
| " | 2368 | 0.2% |
| % | 914 | 0.1% |
| Other values (9) | 747 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 31850 | |
| 1 | 18139 | |
| 5 | 15491 | |
| 2 | 14398 | |
| 3 | 10095 | 8.7% |
| 4 | 8421 | 7.2% |
| 9 | 4806 | 4.1% |
| 6 | 4747 | 4.1% |
| 8 | 4630 | 4.0% |
| 7 | 3768 | 3.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 8421 | |
| ± | 961 | 9.1% |
| = | 459 | 4.4% |
| > | 261 | 2.5% |
| < | 223 | 2.1% |
| | | 157 | 1.5% |
| ~ | 47 | 0.4% |
| × | 4 | < 0.1% |
| ÷ | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 27667 | |
| [ | 2321 | 7.7% |
| { | 8 | < 0.1% |
| ‚ | 5 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 26 | |
| ´ | 8 | 20.5% |
| ^ | 4 | 10.3% |
| ¨ | 1 | 2.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 27591 | |
| ] | 2299 | 7.7% |
| } | 8 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 148 | |
| ² | 60 | |
| ¼ | 2 | 1.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 17 | |
| ’ | 1 | 5.3% |
| » | 1 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 77753 | |
| – | 76 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 11945 | ||
| 63 | 0.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 17 | |
| « | 1 | 5.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 3 | |
| ¢ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4478314 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 612 |
Other Letter
| Value | Count | Frequency (%) |
| º | 57 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29912953 | |
| Common | 6226390 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3514056 | |
| r | 2590696 | 8.7% |
| a | 2510661 | 8.4% |
| o | 2468359 | 8.3% |
| n | 2068334 | 6.9% |
| s | 1995921 | 6.7% |
| t | 1847880 | 6.2% |
| i | 1812691 | 6.1% |
| d | 1411076 | 4.7% |
| l | 1409280 | 4.7% |
| Other values (91) | 8283999 |
Common
| Value | Count | Frequency (%) |
| 4478314 | ||
| . | 943776 | 15.2% |
| , | 401980 | 6.5% |
| - | 77753 | 1.2% |
| ; | 77327 | 1.2% |
| 0 | 31850 | 0.5% |
| ( | 27667 | 0.4% |
| ) | 27591 | 0.4% |
| 1 | 18139 | 0.3% |
| ' | 17235 | 0.3% |
| Other values (57) | 124758 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36039670 | |
| None | 99549 | 0.3% |
| Punctuation | 124 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4478314 | 12.4% | |
| e | 3514056 | 9.8% |
| r | 2590696 | 7.2% |
| a | 2510661 | 7.0% |
| o | 2468359 | 6.8% |
| n | 2068334 | 5.7% |
| s | 1995921 | 5.5% |
| t | 1847880 | 5.1% |
| i | 1812691 | 5.0% |
| d | 1411076 | 3.9% |
| Other values (86) | 11341682 |
None
| Value | Count | Frequency (%) |
| é | 30664 | |
| ê | 27028 | |
| è | 14313 | |
| à | 7116 | 7.1% |
| á | 3678 | 3.7% |
| ä | 2477 | 2.5% |
| í | 1621 | 1.6% |
| ü | 1613 | 1.6% |
| ú | 1252 | 1.3% |
| ó | 1130 | 1.1% |
| Other values (56) | 8657 | 8.7% |
Punctuation
| Value | Count | Frequency (%) |
| – | 76 | |
| “ | 17 | 13.7% |
| ” | 17 | 13.7% |
| † | 8 | 6.5% |
| ‚ | 5 | 4.0% |
| ’ | 1 | 0.8% |
samplingEffort
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.0 m |
|---|
| Value | Count | Frequency (%) |
| 0.0 | 1 | |
| m | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 | ||
| m | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 | |
| Space Separator | 1 | |
| Lowercase Letter | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 | |
| Latin | 1 | 20.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 |
Latin
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 | |
| 1 | ||
| m | 1 |
continent
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 902365 |
| Missing (%) | 18.0% |
| Memory size | 38.3 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 7.327123048 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Europe |
|---|---|
| 2nd row | Asia |
| 3rd row | Europe |
| 4th row | Asia |
| 5th row | Europe |
| Value | Count | Frequency (%) |
| asia | 1235811 | |
| europe | 1145221 | |
| africa | 713929 | |
| america | 661234 | |
| southern | 417866 | 8.7% |
| australasia | 358600 | 7.5% |
| central | 124578 | 2.6% |
| north | 118790 | 2.5% |
| antarctica | 1561 | < 0.1% |
| africa/asia | 1061 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3816596 | |
| r | 3542840 | |
| A | 2973257 | |
| i | 2973257 | |
| e | 2348899 | 7.8% |
| s | 1954072 | 6.5% |
| u | 1921687 | 6.4% |
| o | 1681877 | 5.6% |
| c | 1379346 | 4.6% |
| p | 1145221 | 3.8% |
| Other values (12) | 6431769 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24726814 | |
| Uppercase Letter | 4779712 | 15.8% |
| Space Separator | 661234 | 2.2% |
| Other Punctuation | 1061 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3816596 | |
| r | 3542840 | |
| i | 2973257 | |
| e | 2348899 | |
| s | 1954072 | |
| u | 1921687 | |
| o | 1681877 | |
| c | 1379346 | 5.6% |
| p | 1145221 | 4.6% |
| t | 1022956 | 4.1% |
| Other values (5) | 2940063 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2973257 | |
| E | 1145221 | 24.0% |
| S | 417866 | 8.7% |
| C | 124578 | 2.6% |
| N | 118790 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 661234 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1061 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29506526 | |
| Common | 662295 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3816596 | |
| r | 3542840 | |
| A | 2973257 | |
| i | 2973257 | |
| e | 2348899 | |
| s | 1954072 | 6.6% |
| u | 1921687 | 6.5% |
| o | 1681877 | 5.7% |
| c | 1379346 | 4.7% |
| p | 1145221 | 3.9% |
| Other values (10) | 5769474 |
Common
| Value | Count | Frequency (%) |
| 661234 | ||
| / | 1061 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30168821 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3816596 | |
| r | 3542840 | |
| A | 2973257 | |
| i | 2973257 | |
| e | 2348899 | 7.8% |
| s | 1954072 | 6.5% |
| u | 1921687 | 6.4% |
| o | 1681877 | 5.6% |
| c | 1379346 | 4.6% |
| p | 1145221 | 3.8% |
| Other values (12) | 6431769 |
island
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 51.41942 |
|---|
| Value | Count | Frequency (%) |
| 51.41942 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 | |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| 9 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 | |
| 5 | 1 | |
| . | 1 | |
| 9 | 1 | |
| 2 | 1 |
country
Text
| Distinct | 259 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 375 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 33 |
| Mean length | 9.385069392 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | France |
|---|---|
| 2nd row | Indonesia |
| 3rd row | France |
| 4th row | Indonesia |
| 5th row | Greece |
| Value | Count | Frequency (%) |
| unknown | 901458 | 14.7% |
| netherlands | 668769 | 10.9% |
| indonesia | 566953 | 9.2% |
| new | 190960 | 3.1% |
| guinea | 166419 | 2.7% |
| papua | 152664 | 2.5% |
| brazil | 120044 | 2.0% |
| united | 119751 | 2.0% |
| france | 116205 | 1.9% |
| australia | 110396 | 1.8% |
| Other values (304) | 3021381 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 6455572 | |
| a | 6144718 | 13.0% |
| e | 3683911 | 7.8% |
| i | 3049965 | 6.5% |
| o | 2676245 | 5.7% |
| s | 2153854 | 4.6% |
| r | 2077292 | 4.4% |
| d | 1855222 | 3.9% |
| l | 1850248 | 3.9% |
| t | 1563534 | 3.3% |
| Other values (57) | 15596922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39272562 | |
| Uppercase Letter | 6271710 | 13.3% |
| Space Separator | 1115878 | 2.4% |
| Other Punctuation | 230177 | 0.5% |
| Close Punctuation | 106346 | 0.2% |
| Open Punctuation | 106346 | 0.2% |
| Dash Punctuation | 4458 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 6455572 | |
| a | 6144718 | |
| e | 3683911 | |
| i | 3049965 | 7.8% |
| o | 2676245 | 6.8% |
| s | 2153854 | 5.5% |
| r | 2077292 | 5.3% |
| d | 1855222 | 4.7% |
| l | 1850248 | 4.7% |
| t | 1563534 | 4.0% |
| Other values (18) | 7762001 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1043022 | |
| N | 915175 | |
| I | 762328 | |
| S | 624792 | |
| M | 401415 | 6.4% |
| G | 377149 | 6.0% |
| A | 373141 | 5.9% |
| C | 366717 | 5.8% |
| P | 335466 | 5.3% |
| B | 239678 | 3.8% |
| Other values (14) | 832827 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 3 | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 8 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 212914 | |
| , | 11185 | 4.9% |
| & | 6077 | 2.6% |
| . | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 106092 | |
| ] | 254 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 106092 | |
| [ | 254 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1115878 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4458 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45544272 | |
| Common | 1563211 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 6455572 | |
| a | 6144718 | |
| e | 3683911 | 8.1% |
| i | 3049965 | 6.7% |
| o | 2676245 | 5.9% |
| s | 2153854 | 4.7% |
| r | 2077292 | 4.6% |
| d | 1855222 | 4.1% |
| l | 1850248 | 4.1% |
| t | 1563534 | 3.4% |
| Other values (42) | 14033711 |
Common
| Value | Count | Frequency (%) |
| 1115878 | ||
| / | 212914 | 13.6% |
| ) | 106092 | 6.8% |
| ( | 106092 | 6.8% |
| , | 11185 | 0.7% |
| & | 6077 | 0.4% |
| - | 4458 | 0.3% |
| [ | 254 | < 0.1% |
| ] | 254 | < 0.1% |
| 7 | 2 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47095541 | |
| None | 11942 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 6455572 | |
| a | 6144718 | 13.0% |
| e | 3683911 | 7.8% |
| i | 3049965 | 6.5% |
| o | 2676245 | 5.7% |
| s | 2153854 | 4.6% |
| r | 2077292 | 4.4% |
| d | 1855222 | 3.9% |
| l | 1850248 | 3.9% |
| t | 1563534 | 3.3% |
| Other values (55) | 15584980 |
None
| Value | Count | Frequency (%) |
| ç | 9002 | |
| é | 2940 | 24.6% |
countryCode
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | WGS84 |
| Value | Count | Frequency (%) |
| wgs84 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 2 | |
| G | 2 | |
| S | 2 | |
| 8 | 2 | |
| 4 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6 | |
| Decimal Number | 4 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 2 | |
| G | 2 | |
| S | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 | |
| Common | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 2 | |
| G | 2 | |
| S | 2 |
Common
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 4 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 2 | |
| G | 2 | |
| S | 2 | |
| 8 | 2 | |
| 4 | 2 |
stateProvince
Text
Missing 
| Distinct | 3223 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3065004 |
| Missing (%) | 61.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 28 |
| Mean length | 8.864043385 |
| Min length | 3 |
Unique
| Unique | 408 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sumatra |
|---|---|
| 2nd row | Borneo |
| 3rd row | Borneo |
| 4th row | Sumatra |
| 5th row | Sumatra |
| Value | Count | Frequency (%) |
| borneo | 230539 | 9.3% |
| new | 206395 | 8.3% |
| guinea | 192672 | 7.8% |
| java | 135629 | 5.5% |
| sumatra | 84195 | 3.4% |
| region | 83893 | 3.4% |
| northern | 54146 | 2.2% |
| zuid-holland | 53025 | 2.1% |
| gelderland | 42250 | 1.7% |
| sulawesi | 38230 | 1.5% |
| Other values (3222) | 1356524 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1947747 | 11.2% |
| e | 1641332 | 9.5% |
| o | 1463019 | 8.4% |
| n | 1452254 | 8.4% |
| r | 1147603 | 6.6% |
| i | 879429 | 5.1% |
| u | 873773 | 5.0% |
| l | 667117 | 3.9% |
| t | 638970 | 3.7% |
| s | 549587 | 3.2% |
| Other values (98) | 6066406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13786801 | |
| Uppercase Letter | 2714365 | 15.7% |
| Space Separator | 522745 | 3.0% |
| Dash Punctuation | 255700 | 1.5% |
| Open Punctuation | 19001 | 0.1% |
| Close Punctuation | 18788 | 0.1% |
| Other Punctuation | 9425 | 0.1% |
| Decimal Number | 236 | < 0.1% |
| Final Punctuation | 171 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1947747 | |
| e | 1641332 | |
| o | 1463019 | |
| n | 1452254 | |
| r | 1147603 | |
| i | 879429 | 6.4% |
| u | 873773 | 6.3% |
| l | 667117 | 4.8% |
| t | 638970 | 4.6% |
| s | 549587 | 4.0% |
| Other values (42) | 2525970 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 396728 | |
| S | 316558 | |
| B | 299278 | |
| G | 278749 | 10.3% |
| J | 140484 | 5.2% |
| M | 133425 | 4.9% |
| L | 124661 | 4.6% |
| H | 116582 | 4.3% |
| R | 104500 | 3.8% |
| C | 91711 | 3.4% |
| Other values (23) | 711689 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 56 | |
| 6 | 53 | |
| 4 | 49 | |
| 3 | 37 | |
| 5 | 17 | 7.2% |
| 8 | 15 | 6.4% |
| 2 | 6 | 2.5% |
| 1 | 3 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5631 | |
| ' | 2710 | |
| , | 1013 | 10.7% |
| & | 34 | 0.4% |
| ? | 20 | 0.2% |
| / | 17 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 522740 | ||
| 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 255531 | |
| – | 169 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 4 | |
| + | 1 | 20.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 19001 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 18788 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 171 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16501166 | |
| Common | 826071 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1947747 | 11.8% |
| e | 1641332 | 9.9% |
| o | 1463019 | 8.9% |
| n | 1452254 | 8.8% |
| r | 1147603 | 7.0% |
| i | 879429 | 5.3% |
| u | 873773 | 5.3% |
| l | 667117 | 4.0% |
| t | 638970 | 3.9% |
| s | 549587 | 3.3% |
| Other values (75) | 5240335 |
Common
| Value | Count | Frequency (%) |
| 522740 | ||
| - | 255531 | |
| ( | 19001 | 2.3% |
| ) | 18788 | 2.3% |
| . | 5631 | 0.7% |
| ' | 2710 | 0.3% |
| , | 1013 | 0.1% |
| ’ | 171 | < 0.1% |
| – | 169 | < 0.1% |
| 7 | 56 | < 0.1% |
| Other values (13) | 261 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17185130 | |
| None | 141767 | 0.8% |
| Punctuation | 340 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1947747 | 11.3% |
| e | 1641332 | 9.6% |
| o | 1463019 | 8.5% |
| n | 1452254 | 8.5% |
| r | 1147603 | 6.7% |
| i | 879429 | 5.1% |
| u | 873773 | 5.1% |
| l | 667117 | 3.9% |
| t | 638970 | 3.7% |
| s | 549587 | 3.2% |
| Other values (62) | 5924299 |
None
| Value | Count | Frequency (%) |
| é | 98312 | |
| á | 15495 | 10.9% |
| í | 6772 | 4.8% |
| ó | 4018 | 2.8% |
| ô | 3957 | 2.8% |
| ü | 3456 | 2.4% |
| ä | 1785 | 1.3% |
| ã | 1771 | 1.2% |
| è | 1140 | 0.8% |
| ö | 899 | 0.6% |
| Other values (24) | 4162 | 2.9% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 171 | |
| – | 169 |
locality
Text
Missing 
| Distinct | 2397188 |
|---|---|
| Distinct (%) | 56.3% |
| Missing | 763737 |
| Missing (%) | 15.2% |
| Memory size | 38.3 MiB |
Length
| Max length | 736849 |
|---|---|
| Median length | 84356 |
| Mean length | 47.16343342 |
| Min length | 1 |
Unique
| Unique | 1909729 ? |
|---|---|
| Unique (%) | 44.9% |
Sample
| 1st row | Nice. |
|---|---|
| 2nd row | E. Coast Sumatra, Siak, Indrapura |
| 3rd row | Corsica; Cargèse. |
| 4th row | Patras, op rots, bij ruine. |
| 5th row | West Borneo, Sintang G. Pahoe |
| Value | Count | Frequency (%) |
| of | 993695 | 3.3% |
| de | 528801 | 1.8% |
| km | 423817 | 1.4% |
| 366885 | 1.2% | |
| in | 319701 | 1.1% |
| the | 236036 | 0.8% |
| road | 222131 | 0.7% |
| near | 220755 | 0.7% |
| bij | 193361 | 0.6% |
| district | 189242 | 0.6% |
| Other values (1181095) | 26099389 |
Most occurring characters
| Value | Count | Frequency (%) |
| 25605653 | 12.8% | |
| a | 17766400 | 8.9% |
| e | 14813627 | 7.4% |
| n | 11419769 | 5.7% |
| o | 10867963 | 5.4% |
| i | 10448228 | 5.2% |
| r | 10001236 | 5.0% |
| t | 7769321 | 3.9% |
| . | 7553473 | 3.8% |
| s | 6867565 | 3.4% |
| Other values (201) | 77616460 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 134893361 | |
| Space Separator | 25605699 | 12.8% |
| Uppercase Letter | 20272038 | 10.1% |
| Other Punctuation | 13323633 | 6.6% |
| Decimal Number | 2817461 | 1.4% |
| Control | 2060911 | 1.0% |
| Dash Punctuation | 719511 | 0.4% |
| Open Punctuation | 478330 | 0.2% |
| Close Punctuation | 476278 | 0.2% |
| Math Symbol | 66580 | < 0.1% |
| Other values (9) | 15893 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 17766400 | |
| e | 14813627 | |
| n | 11419769 | 8.5% |
| o | 10867963 | 8.1% |
| i | 10448228 | 7.7% |
| r | 10001236 | 7.4% |
| t | 7769321 | 5.8% |
| s | 6867565 | 5.1% |
| l | 6810224 | 5.0% |
| u | 5143259 | 3.8% |
| Other values (52) | 32985769 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1935677 | 9.5% |
| P | 1549270 | 7.6% |
| M | 1469722 | 7.2% |
| B | 1423966 | 7.0% |
| N | 1251858 | 6.2% |
| C | 1191407 | 5.9% |
| A | 1117672 | 5.5% |
| R | 950359 | 4.7% |
| T | 945685 | 4.7% |
| L | 891981 | 4.4% |
| Other values (49) | 7544441 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7553473 | |
| , | 4275769 | |
| : | 615518 | 4.6% |
| ; | 205722 | 1.5% |
| ' | 177844 | 1.3% |
| / | 148168 | 1.1% |
| ! | 115191 | 0.9% |
| * | 109104 | 0.8% |
| " | 65200 | 0.5% |
| ? | 32465 | 0.2% |
| Other values (13) | 25179 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 514984 | |
| 0 | 394884 | |
| 2 | 365497 | |
| 5 | 337037 | |
| 3 | 291722 | |
| 4 | 261157 | |
| 6 | 215405 | |
| 8 | 157352 | 5.6% |
| 7 | 147731 | 5.2% |
| 9 | 131692 | 4.7% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 22174 | |
| ± | 20926 | |
| = | 14977 | |
| > | 3950 | 5.9% |
| < | 2231 | 3.4% |
| + | 2163 | 3.2% |
| ~ | 108 | 0.2% |
| × | 45 | 0.1% |
| ÷ | 4 | < 0.1% |
| ¬ | 2 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 5642 | |
| ¼ | 984 | 13.4% |
| ¾ | 570 | 7.8% |
| ² | 92 | 1.3% |
| ³ | 29 | 0.4% |
| ¹ | 2 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 30 | |
| $ | 19 | |
| ¤ | 3 | 5.4% |
| ¥ | 2 | 3.6% |
| £ | 1 | 1.8% |
| € | 1 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 432219 | |
| [ | 45283 | 9.5% |
| „ | 530 | 0.1% |
| ‚ | 207 | < 0.1% |
| { | 91 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 87 | |
| ` | 86 | |
| ^ | 33 | 15.7% |
| ¨ | 3 | 1.4% |
| ¯ | 1 | 0.5% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 163 | |
| ’ | 82 | |
| › | 28 | 9.5% |
| ” | 22 | 7.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 158 | |
| ‹ | 43 | 19.2% |
| “ | 18 | 8.0% |
| ‘ | 5 | 2.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 719396 | |
| – | 107 | < 0.1% |
| — | 8 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 431154 | |
| ] | 45061 | 9.5% |
| } | 63 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5587 | |
| ® | 19 | 0.3% |
| ¦ | 10 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 25605653 | ||
| 46 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 2050064 | ||
| 10847 | 0.5% |
Other Letter
| Value | Count | Frequency (%) |
| º | 1203 | |
| ª | 25 | 2.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 932 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 155166627 | |
| Common | 45563068 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 17766400 | 11.4% |
| e | 14813627 | 9.5% |
| n | 11419769 | 7.4% |
| o | 10867963 | 7.0% |
| i | 10448228 | 6.7% |
| r | 10001236 | 6.4% |
| t | 7769321 | 5.0% |
| s | 6867565 | 4.4% |
| l | 6810224 | 4.4% |
| u | 5143259 | 3.3% |
| Other values (113) | 53259035 |
Common
| Value | Count | Frequency (%) |
| 25605653 | ||
| . | 7553473 | 16.6% |
| , | 4275769 | 9.4% |
| 2050064 | 4.5% | |
| - | 719396 | 1.6% |
| : | 615518 | 1.4% |
| 1 | 514984 | 1.1% |
| ( | 432219 | 0.9% |
| ) | 431154 | 0.9% |
| 0 | 394884 | 0.9% |
| Other values (78) | 2969954 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200038856 | |
| None | 689579 | 0.3% |
| Punctuation | 1246 | < 0.1% |
| Modifier Letters | 13 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 25605653 | 12.8% | |
| a | 17766400 | 8.9% |
| e | 14813627 | 7.4% |
| n | 11419769 | 5.7% |
| o | 10867963 | 5.4% |
| i | 10448228 | 5.2% |
| r | 10001236 | 5.0% |
| t | 7769321 | 3.9% |
| . | 7553473 | 3.8% |
| s | 6867565 | 3.4% |
| Other values (87) | 76925621 |
None
| Value | Count | Frequency (%) |
| é | 242306 | |
| á | 51416 | 7.5% |
| è | 49482 | 7.2% |
| ü | 38173 | 5.5% |
| ö | 30031 | 4.4% |
| í | 28900 | 4.2% |
| ë | 25491 | 3.7% |
| ó | 23636 | 3.4% |
| ä | 23245 | 3.4% |
| ê | 23036 | 3.3% |
| Other values (88) | 153863 |
Punctuation
| Value | Count | Frequency (%) |
| „ | 530 | |
| ‚ | 207 | 16.6% |
| – | 107 | 8.6% |
| ’ | 82 | 6.6% |
| ‡ | 76 | 6.1% |
| ‰ | 74 | 5.9% |
| ‹ | 43 | 3.5% |
| … | 40 | 3.2% |
| › | 28 | 2.2% |
| ” | 22 | 1.8% |
| Other values (4) | 37 | 3.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 13 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Missing 
| Distinct | 7745 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 3265629 |
| Missing (%) | 65.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 5 |
| Mean length | 6.313657931 |
| Min length | 5 |
Unique
| Unique | 2081 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 10.0 m |
|---|---|
| 2nd row | 600.0 m |
| 3rd row | 250.0 m |
| 4th row | 20.0 m |
| 5th row | 4.0 m |
| Value | Count | Frequency (%) |
| m | 1754153 | |
| 0.0 | 971270 | |
| 83446 | 2.3% | |
| 100.0 | 30322 | 0.8% |
| 200.0 | 27380 | 0.7% |
| 50.0 | 25985 | 0.7% |
| 300.0 | 21545 | 0.6% |
| 400.0 | 20709 | 0.6% |
| 500.0 | 20313 | 0.6% |
| 1000.0 | 18743 | 0.5% |
| Other values (3230) | 701332 | 19.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3847172 | |
| 1921045 | ||
| . | 1837599 | |
| m | 1754153 | |
| 1 | 378270 | 3.4% |
| 5 | 310048 | 2.8% |
| 2 | 249943 | 2.3% |
| 3 | 159594 | 1.4% |
| 4 | 133323 | 1.2% |
| 6 | 114954 | 1.0% |
| Other values (4) | 369021 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5476760 | |
| Space Separator | 1921045 | 17.3% |
| Other Punctuation | 1837599 | 16.6% |
| Lowercase Letter | 1754153 | 15.8% |
| Dash Punctuation | 85565 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3847172 | |
| 1 | 378270 | 6.9% |
| 5 | 310048 | 5.7% |
| 2 | 249943 | 4.6% |
| 3 | 159594 | 2.9% |
| 4 | 133323 | 2.4% |
| 6 | 114954 | 2.1% |
| 7 | 110378 | 2.0% |
| 8 | 95036 | 1.7% |
| 9 | 78042 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1921045 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1837599 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1754153 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 85565 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9320969 | |
| Latin | 1754153 | 15.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3847172 | |
| 1921045 | ||
| . | 1837599 | |
| 1 | 378270 | 4.1% |
| 5 | 310048 | 3.3% |
| 2 | 249943 | 2.7% |
| 3 | 159594 | 1.7% |
| 4 | 133323 | 1.4% |
| 6 | 114954 | 1.2% |
| 7 | 110378 | 1.2% |
| Other values (3) | 258643 | 2.8% |
Latin
| Value | Count | Frequency (%) |
| m | 1754153 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11075122 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3847172 | |
| 1921045 | ||
| . | 1837599 | |
| m | 1754153 | |
| 1 | 378270 | 3.4% |
| 5 | 310048 | 2.8% |
| 2 | 249943 | 2.3% |
| 3 | 159594 | 1.4% |
| 4 | 133323 | 1.2% |
| 6 | 114954 | 1.0% |
| Other values (4) | 369021 | 3.3% |
maximumDistanceAboveSurfaceInMeters
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Asia |
|---|
| Value | Count | Frequency (%) |
| asia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1 | |
| s | 1 | |
| i | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3 | |
| Uppercase Letter | 1 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1 | |
| i | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1 | |
| s | 1 | |
| i | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1 | |
| s | 1 | |
| i | 1 | |
| a | 1 |
decimalLatitude
Text
Missing 
| Distinct | 85312 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 2925885 |
| Missing (%) | 58.3% |
| Memory size | 38.3 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.935163477 |
| Min length | 3 |
Unique
| Unique | 32905 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | -2.06667 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | -2.18333 |
| 4th row | -2.18333 |
| 5th row | 1.16667 |
| Value | Count | Frequency (%) |
| 52.16011 | 15831 | 0.8% |
| 7.25 | 9141 | 0.4% |
| 5.83333 | 8412 | 0.4% |
| 3.08333 | 7806 | 0.4% |
| 1.0 | 7629 | 0.4% |
| 6.08333 | 7109 | 0.3% |
| 5.38333 | 6962 | 0.3% |
| 5.33333 | 6857 | 0.3% |
| 52.14714 | 6321 | 0.3% |
| 5.0 | 6138 | 0.3% |
| Other values (80543) | 2011691 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2185740 | |
| . | 2093897 | |
| 6 | 1593621 | |
| 5 | 1522084 | |
| 1 | 1433173 | |
| 7 | 1070771 | |
| 2 | 1042278 | |
| 8 | 898646 | |
| 4 | 733048 | 5.0% |
| 0 | 727483 | 5.0% |
| Other values (3) | 1220777 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11817460 | |
| Other Punctuation | 2093897 | 14.4% |
| Dash Punctuation | 610144 | 4.2% |
| Uppercase Letter | 17 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2185740 | |
| 6 | 1593621 | |
| 5 | 1522084 | |
| 1 | 1433173 | |
| 7 | 1070771 | |
| 2 | 1042278 | |
| 8 | 898646 | |
| 4 | 733048 | 6.2% |
| 0 | 727483 | 6.2% |
| 9 | 610616 | 5.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2093897 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 610144 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14521501 | |
| Latin | 17 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2185740 | |
| . | 2093897 | |
| 6 | 1593621 | |
| 5 | 1522084 | |
| 1 | 1433173 | |
| 7 | 1070771 | |
| 2 | 1042278 | |
| 8 | 898646 | |
| 4 | 733048 | 5.0% |
| 0 | 727483 | 5.0% |
| Other values (2) | 1220760 |
Latin
| Value | Count | Frequency (%) |
| E | 17 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14521518 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2185740 | |
| . | 2093897 | |
| 6 | 1593621 | |
| 5 | 1522084 | |
| 1 | 1433173 | |
| 7 | 1070771 | |
| 2 | 1042278 | |
| 8 | 898646 | |
| 4 | 733048 | 5.0% |
| 0 | 727483 | 5.0% |
| Other values (3) | 1220777 |
decimalLongitude
Text
Missing 
| Distinct | 95316 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 2925885 |
| Missing (%) | 58.3% |
| Memory size | 38.3 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.304637239 |
| Min length | 3 |
Unique
| Unique | 35846 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 100.93333 |
|---|---|
| 2nd row | 112.0 |
| 3rd row | 99.65 |
| 4th row | 99.65 |
| 5th row | 124.58333 |
| Value | Count | Frequency (%) |
| 4.49701 | 15831 | 0.8% |
| 10.41667 | 7696 | 0.4% |
| 4.05 | 7530 | 0.4% |
| 3.01667 | 7109 | 0.3% |
| 4.47406 | 6117 | 0.3% |
| 5.85874 | 5829 | 0.3% |
| 106.7913 | 5061 | 0.2% |
| 4.32798 | 5000 | 0.2% |
| 4.90993 | 4858 | 0.2% |
| 4.47863 | 4793 | 0.2% |
| Other values (91291) | 2024073 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2153780 | |
| . | 2093896 | |
| 1 | 2080828 | |
| 6 | 1818372 | |
| 7 | 1184132 | |
| 5 | 1165208 | |
| 4 | 1075292 | |
| 8 | 917920 | |
| 0 | 850927 | 5.6% |
| 9 | 849460 | 5.6% |
| Other values (10) | 1105343 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12912796 | |
| Other Punctuation | 2093896 | 13.7% |
| Dash Punctuation | 288457 | 1.9% |
| Lowercase Letter | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2153780 | |
| 1 | 2080828 | |
| 6 | 1818372 | |
| 7 | 1184132 | |
| 5 | 1165208 | |
| 4 | 1075292 | |
| 8 | 917920 | |
| 0 | 850927 | 6.6% |
| 9 | 849460 | 6.6% |
| 2 | 816877 | 6.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 1 | |
| i | 1 | |
| l | 1 | |
| n | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| T | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2093896 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 288457 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15295149 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2153780 | |
| . | 2093896 | |
| 1 | 2080828 | |
| 6 | 1818372 | |
| 7 | 1184132 | |
| 5 | 1165208 | |
| 4 | 1075292 | |
| 8 | 917920 | |
| 0 | 850927 | 5.6% |
| 9 | 849460 | 5.6% |
| Other values (2) | 1105334 |
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| E | 1 | |
| T | 1 | |
| h | 1 | |
| i | 1 | |
| l | 1 | |
| n | 1 | |
| d | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15295158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2153780 | |
| . | 2093896 | |
| 1 | 2080828 | |
| 6 | 1818372 | |
| 7 | 1184132 | |
| 5 | 1165208 | |
| 4 | 1075292 | |
| 8 | 917920 | |
| 0 | 850927 | 5.6% |
| 9 | 849460 | 5.6% |
| Other values (10) | 1105343 |
geodeticDatum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | WGS84 |
| 3rd row | WGS84 |
| 4th row | WGS84 |
| 5th row | WGS84 |
| Value | Count | Frequency (%) |
| wgs84 | 5019779 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 5019779 | |
| G | 5019779 | |
| S | 5019779 | |
| 8 | 5019779 | |
| 4 | 5019779 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15059337 | |
| Decimal Number | 10039558 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 5019779 | |
| G | 5019779 | |
| S | 5019779 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 5019779 | |
| 4 | 5019779 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15059337 | |
| Common | 10039558 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 5019779 | |
| G | 5019779 | |
| S | 5019779 |
Common
| Value | Count | Frequency (%) |
| 8 | 5019779 | |
| 4 | 5019779 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25098895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 5019779 | |
| G | 5019779 | |
| S | 5019779 | |
| 8 | 5019779 | |
| 4 | 5019779 |
coordinateUncertaintyInMeters
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | South-Western |
|---|
| Value | Count | Frequency (%) |
| south-western | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2 | |
| e | 2 | |
| S | 1 | |
| o | 1 | |
| u | 1 | |
| h | 1 | |
| - | 1 | |
| W | 1 | |
| s | 1 | |
| r | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 15.4% |
| Dash Punctuation | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2 | |
| e | 2 | |
| o | 1 | |
| u | 1 | |
| h | 1 | |
| s | 1 | |
| r | 1 | |
| n | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| W | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 1 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2 | |
| e | 2 | |
| S | 1 | |
| o | 1 | |
| u | 1 | |
| h | 1 | |
| W | 1 | |
| s | 1 | |
| r | 1 | |
| n | 1 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2 | |
| e | 2 | |
| S | 1 | |
| o | 1 | |
| u | 1 | |
| h | 1 | |
| - | 1 | |
| W | 1 | |
| s | 1 | |
| r | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 70 |
|---|---|
| Median length | 70 |
| Mean length | 70 |
| Min length | 70 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Siam [Thailand], Kwae Noi Basin Expedition, near Neeckey, near Wangka. |
|---|
| Value | Count | Frequency (%) |
| near | 2 | |
| siam | 1 | |
| thailand | 1 | |
| kwae | 1 | |
| noi | 1 | |
| basin | 1 | |
| expedition | 1 | |
| neeckey | 1 | |
| wangka | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | ||
| a | 9 | |
| e | 7 | 10.0% |
| i | 6 | 8.6% |
| n | 6 | 8.6% |
| , | 3 | 4.3% |
| k | 2 | 2.9% |
| d | 2 | 2.9% |
| r | 2 | 2.9% |
| N | 2 | 2.9% |
| Other values (21) | 22 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47 | |
| Space Separator | 9 | 12.9% |
| Uppercase Letter | 8 | 11.4% |
| Other Punctuation | 4 | 5.7% |
| Close Punctuation | 1 | 1.4% |
| Open Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| e | 7 | |
| i | 6 | |
| n | 6 | |
| k | 2 | 4.3% |
| d | 2 | 4.3% |
| r | 2 | 4.3% |
| o | 2 | 4.3% |
| y | 1 | 2.1% |
| c | 1 | 2.1% |
| Other values (9) | 9 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| W | 1 | |
| E | 1 | |
| S | 1 | |
| B | 1 | |
| K | 1 | |
| T | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55 | |
| Common | 15 | 21.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| e | 7 | |
| i | 6 | 10.9% |
| n | 6 | 10.9% |
| k | 2 | 3.6% |
| d | 2 | 3.6% |
| r | 2 | 3.6% |
| N | 2 | 3.6% |
| o | 2 | 3.6% |
| W | 1 | 1.8% |
| Other values (16) | 16 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| , | 3 | 20.0% |
| ] | 1 | 6.7% |
| [ | 1 | 6.7% |
| . | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | ||
| a | 9 | |
| e | 7 | 10.0% |
| i | 6 | 8.6% |
| n | 6 | 8.6% |
| , | 3 | 4.3% |
| k | 2 | 2.9% |
| d | 2 | 2.9% |
| r | 2 | 2.9% |
| N | 2 | 2.9% |
| Other values (21) | 22 |
verbatimSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 150.0 m |
|---|
| Value | Count | Frequency (%) |
| 150.0 | 1 | |
| m | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| . | 1 | |
| 1 | ||
| m | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 14.3% |
| Space Separator | 1 | 14.3% |
| Lowercase Letter | 1 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 | |
| Latin | 1 | 14.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| . | 1 | |
| 1 |
Latin
| Value | Count | Frequency (%) |
| m | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| . | 1 | |
| 1 | ||
| m | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 15.1 |
|---|
| Value | Count | Frequency (%) |
| 15.1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 5 | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Other Punctuation | 1 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 5 | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 5 | 1 | |
| . | 1 |
earliestEonOrLowestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 98.46667 |
|---|
| Value | Count | Frequency (%) |
| 98.46667 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 9 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| . | 1 | 12.5% |
| 4 | 1 | 12.5% |
| 7 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 | |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 9 | 1 | 14.3% |
| 8 | 1 | 14.3% |
| 4 | 1 | 14.3% |
| 7 | 1 | 14.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 9 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| . | 1 | 12.5% |
| 4 | 1 | 12.5% |
| 7 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 9 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| . | 1 | 12.5% |
| 4 | 1 | 12.5% |
| 7 | 1 | 12.5% |
latestEonOrHighestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | WGS84 |
|---|
| Value | Count | Frequency (%) |
| wgs84 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 1 | |
| G | 1 | |
| S | 1 | |
| 8 | 1 | |
| 4 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3 | |
| Decimal Number | 2 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 | |
| G | 1 | |
| S | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3 | |
| Common | 2 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 1 | |
| G | 1 | |
| S | 1 |
Common
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 1 | |
| G | 1 | |
| S | 1 | |
| 8 | 1 | |
| 4 | 1 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9.5 |
| Mean length | 9.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Bakker S |
|---|---|
| 2nd row | Pedersen TM |
| Value | Count | Frequency (%) |
| bakker | 1 | |
| s | 1 | |
| pedersen | 1 | |
| tm | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4 | |
| k | 2 | |
| r | 2 | |
| 2 | ||
| B | 1 | 5.3% |
| a | 1 | 5.3% |
| S | 1 | 5.3% |
| P | 1 | 5.3% |
| d | 1 | 5.3% |
| s | 1 | 5.3% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 5 | |
| Space Separator | 2 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4 | |
| k | 2 | |
| r | 2 | |
| a | 1 | 8.3% |
| d | 1 | 8.3% |
| s | 1 | 8.3% |
| n | 1 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 | |
| S | 1 | |
| P | 1 | |
| T | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17 | |
| Common | 2 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4 | |
| k | 2 | |
| r | 2 | |
| B | 1 | 5.9% |
| a | 1 | 5.9% |
| S | 1 | 5.9% |
| P | 1 | 5.9% |
| d | 1 | 5.9% |
| s | 1 | 5.9% |
| n | 1 | 5.9% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4 | |
| k | 2 | |
| r | 2 | |
| 2 | ||
| B | 1 | 5.3% |
| a | 1 | 5.3% |
| S | 1 | 5.3% |
| P | 1 | 5.3% |
| d | 1 | 5.3% |
| s | 1 | 5.3% |
| Other values (3) | 3 |
bed
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 32.5 |
| Mean length | 32.5 |
| Min length | 26 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia caesia (Hoffm.) Hampe ex Fürnr. |
|---|---|
| 2nd row | Paullinia elegans Cambess. |
| Value | Count | Frequency (%) |
| physcia | 1 | |
| caesia | 1 | |
| hoffm | 1 | |
| hampe | 1 | |
| ex | 1 | |
| fürnr | 1 | |
| paullinia | 1 | |
| elegans | 1 | |
| cambess | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | 12.3% |
| 7 | 10.8% | |
| e | 6 | 9.2% |
| s | 5 | 7.7% |
| i | 4 | 6.2% |
| m | 3 | 4.6% |
| l | 3 | 4.6% |
| n | 3 | 4.6% |
| . | 3 | 4.6% |
| f | 2 | 3.1% |
| Other values (17) | 21 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47 | |
| Space Separator | 7 | 10.8% |
| Uppercase Letter | 6 | 9.2% |
| Other Punctuation | 3 | 4.6% |
| Close Punctuation | 1 | 1.5% |
| Open Punctuation | 1 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| s | 5 | |
| i | 4 | |
| m | 3 | 6.4% |
| l | 3 | 6.4% |
| n | 3 | 6.4% |
| f | 2 | 4.3% |
| r | 2 | 4.3% |
| c | 2 | 4.3% |
| Other values (9) | 9 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| H | 2 | |
| F | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53 | |
| Common | 12 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | 11.3% |
| s | 5 | 9.4% |
| i | 4 | 7.5% |
| m | 3 | 5.7% |
| l | 3 | 5.7% |
| n | 3 | 5.7% |
| f | 2 | 3.8% |
| r | 2 | 3.8% |
| P | 2 | 3.8% |
| Other values (13) | 15 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| . | 3 | |
| ) | 1 | 8.3% |
| ( | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64 | |
| None | 1 | 1.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | 12.5% |
| 7 | 10.9% | |
| e | 6 | 9.4% |
| s | 5 | 7.8% |
| i | 4 | 6.2% |
| m | 3 | 4.7% |
| l | 3 | 4.7% |
| n | 3 | 4.7% |
| . | 3 | 4.7% |
| f | 2 | 3.1% |
| Other values (16) | 20 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
typeStatus
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4932431 |
| Missing (%) | 98.3% |
| Memory size | 38.3 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.994413344 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | holotype |
|---|---|
| 2nd row | isotype |
| 3rd row | type |
| 4th row | lectotype |
| 5th row | type |
| Value | Count | Frequency (%) |
| isotype | 39000 | |
| holotype | 14456 | 16.5% |
| type | 14207 | 16.3% |
| syntype | 8771 | 10.0% |
| lectotype | 3004 | 3.4% |
| paratype | 2913 | 3.3% |
| isolectotype | 2782 | 3.2% |
| isosyntype | 1268 | 1.5% |
| neotype | 578 | 0.7% |
| isoneotype | 275 | 0.3% |
| Other values (4) | 97 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 97390 | |
| e | 94065 | |
| t | 93181 | |
| p | 90361 | |
| o | 78963 | |
| s | 53385 | |
| i | 43399 | |
| l | 20264 | 3.3% |
| h | 14456 | 2.4% |
| n | 10892 | 1.8% |
| Other values (3) | 14613 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 610969 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 97390 | |
| e | 94065 | |
| t | 93181 | |
| p | 90361 | |
| o | 78963 | |
| s | 53385 | |
| i | 43399 | |
| l | 20264 | 3.3% |
| h | 14456 | 2.4% |
| n | 10892 | 1.8% |
| Other values (3) | 14613 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 610969 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 97390 | |
| e | 94065 | |
| t | 93181 | |
| p | 90361 | |
| o | 78963 | |
| s | 53385 | |
| i | 43399 | |
| l | 20264 | 3.3% |
| h | 14456 | 2.4% |
| n | 10892 | 1.8% |
| Other values (3) | 14613 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 610969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 97390 | |
| e | 94065 | |
| t | 93181 | |
| p | 90361 | |
| o | 78963 | |
| s | 53385 | |
| i | 43399 | |
| l | 20264 | 3.3% |
| h | 14456 | 2.4% |
| n | 10892 | 1.8% |
| Other values (3) | 14613 | 2.4% |
identifiedBy
Text
Missing 
| Distinct | 12783 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 4152104 |
| Missing (%) | 82.7% |
| Memory size | 38.3 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 54 |
| Mean length | 11.4119028 |
| Min length | 1 |
Unique
| Unique | 4702 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Wood GHS |
|---|---|
| 2nd row | Steenis CGGJ van |
| 3rd row | Pereira JT; Wong KM |
| 4th row | Ashton PS |
| 5th row | Nooteboom HP |
| Value | Count | Frequency (%) |
| van | 89759 | 4.4% |
| de | 47967 | 2.4% |
| der | 26776 | 1.3% |
| p | 26721 | 1.3% |
| a | 25734 | 1.3% |
| maas | 25086 | 1.2% |
| j | 24227 | 1.2% |
| jongkind | 21969 | 1.1% |
| cch | 21965 | 1.1% |
| d | 21201 | 1.0% |
| Other values (9388) | 1699526 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1163257 | 11.7% | |
| e | 898443 | 9.1% |
| n | 642275 | 6.5% |
| a | 603392 | 6.1% |
| r | 456478 | 4.6% |
| o | 414361 | 4.2% |
| J | 355793 | 3.6% |
| i | 334328 | 3.4% |
| s | 326378 | 3.3% |
| l | 308100 | 3.1% |
| Other values (99) | 4399052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5818627 | |
| Uppercase Letter | 2825370 | |
| Space Separator | 1163257 | 11.7% |
| Other Punctuation | 59304 | 0.6% |
| Dash Punctuation | 34039 | 0.3% |
| Close Punctuation | 602 | < 0.1% |
| Open Punctuation | 602 | < 0.1% |
| Decimal Number | 53 | < 0.1% |
| Connector Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 898443 | |
| n | 642275 | |
| a | 603392 | |
| r | 456478 | 7.8% |
| o | 414361 | 7.1% |
| i | 334328 | 5.7% |
| s | 326378 | 5.6% |
| l | 308100 | 5.3% |
| d | 275276 | 4.7% |
| t | 207110 | 3.6% |
| Other values (41) | 1352486 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 355793 | |
| C | 236242 | 8.4% |
| M | 234695 | 8.3% |
| H | 210215 | 7.4% |
| A | 199191 | 7.1% |
| P | 176050 | 6.2% |
| S | 167959 | 5.9% |
| B | 159051 | 5.6% |
| W | 151893 | 5.4% |
| L | 108781 | 3.9% |
| Other values (27) | 825500 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 54937 | |
| . | 2962 | 5.0% |
| ' | 1054 | 1.8% |
| ! | 283 | 0.5% |
| ? | 56 | 0.1% |
| : | 6 | < 0.1% |
| & | 5 | < 0.1% |
| * | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15 | |
| 9 | 10 | |
| 6 | 9 | |
| 4 | 8 | |
| 3 | 4 | 7.5% |
| 2 | 4 | 7.5% |
| 5 | 2 | 3.8% |
| 0 | 1 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1163257 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34039 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 602 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 602 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8643997 | |
| Common | 1257860 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 898443 | 10.4% |
| n | 642275 | 7.4% |
| a | 603392 | 7.0% |
| r | 456478 | 5.3% |
| o | 414361 | 4.8% |
| J | 355793 | 4.1% |
| i | 334328 | 3.9% |
| s | 326378 | 3.8% |
| l | 308100 | 3.6% |
| d | 275276 | 3.2% |
| Other values (78) | 4029173 |
Common
| Value | Count | Frequency (%) |
| 1163257 | ||
| ; | 54937 | 4.4% |
| - | 34039 | 2.7% |
| . | 2962 | 0.2% |
| ' | 1054 | 0.1% |
| ) | 602 | < 0.1% |
| ( | 602 | < 0.1% |
| ! | 283 | < 0.1% |
| ? | 56 | < 0.1% |
| 1 | 15 | < 0.1% |
| Other values (11) | 53 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9881797 | |
| None | 20060 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1163257 | 11.8% | |
| e | 898443 | 9.1% |
| n | 642275 | 6.5% |
| a | 603392 | 6.1% |
| r | 456478 | 4.6% |
| o | 414361 | 4.2% |
| J | 355793 | 3.6% |
| i | 334328 | 3.4% |
| s | 326378 | 3.3% |
| l | 308100 | 3.1% |
| Other values (63) | 4378992 |
None
| Value | Count | Frequency (%) |
| é | 5736 | |
| á | 4134 | |
| í | 2324 | |
| ö | 2142 | 10.7% |
| ü | 1281 | 6.4% |
| è | 672 | 3.3% |
| ñ | 660 | 3.3% |
| ä | 525 | 2.6% |
| ó | 423 | 2.1% |
| ú | 334 | 1.7% |
| Other values (26) | 1829 | 9.1% |
dateIdentified
Text
Missing 
| Distinct | 16460 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 4581006 |
| Missing (%) | 91.3% |
| Memory size | 38.3 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 10 |
| Mean length | 10.00016409 |
| Min length | 10 |
Unique
| Unique | 4674 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 1956/11/22 |
|---|---|
| 2nd row | 1995/09/27 |
| 3rd row | 1968/07/01 |
| 4th row | 1972/06/01 |
| 5th row | 1957/01/18 |
| Value | Count | Frequency (%) |
| 1955/03/01 | 2137 | 0.5% |
| 1972/06/01 | 2001 | 0.5% |
| 1968/07/01 | 1800 | 0.4% |
| 2001/12/01 | 1724 | 0.4% |
| 1995/10/01 | 1545 | 0.4% |
| 1979/08/01 | 1473 | 0.3% |
| 1989/08/01 | 1409 | 0.3% |
| 2000/06/01 | 1393 | 0.3% |
| 2000/01/01 | 1358 | 0.3% |
| 2000/12/01 | 1344 | 0.3% |
| Other values (16450) | 422592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1100102 | |
| 1 | 886397 | |
| / | 877548 | |
| 2 | 396674 | 9.0% |
| 9 | 393348 | 9.0% |
| 8 | 143873 | 3.3% |
| 7 | 131859 | 3.0% |
| 5 | 122219 | 2.8% |
| 6 | 121562 | 2.8% |
| 3 | 110818 | 2.5% |
| Other values (25) | 103432 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3510192 | |
| Other Punctuation | 877548 | 20.0% |
| Lowercase Letter | 76 | < 0.1% |
| Uppercase Letter | 9 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| i | 8 | |
| c | 8 | |
| n | 7 | |
| s | 6 | |
| l | 4 | 5.3% |
| h | 3 | 3.9% |
| o | 2 | 2.6% |
| y | 2 | 2.6% |
| Other values (7) | 10 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1100102 | |
| 1 | 886397 | |
| 2 | 396674 | 11.3% |
| 9 | 393348 | 11.2% |
| 8 | 143873 | 4.1% |
| 7 | 131859 | 3.8% |
| 5 | 122219 | 3.5% |
| 6 | 121562 | 3.5% |
| 3 | 110818 | 3.2% |
| 4 | 103340 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3 | |
| P | 2 | |
| S | 2 | |
| C | 1 | 11.1% |
| F | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 877548 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4387747 | |
| Latin | 85 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| i | 8 | |
| c | 8 | |
| n | 7 | 8.2% |
| s | 6 | 7.1% |
| l | 4 | 4.7% |
| L | 3 | 3.5% |
| h | 3 | 3.5% |
| o | 2 | 2.4% |
| Other values (12) | 18 |
Common
| Value | Count | Frequency (%) |
| 0 | 1100102 | |
| 1 | 886397 | |
| / | 877548 | |
| 2 | 396674 | 9.0% |
| 9 | 393348 | 9.0% |
| 8 | 143873 | 3.3% |
| 7 | 131859 | 3.0% |
| 5 | 122219 | 2.8% |
| 6 | 121562 | 2.8% |
| 3 | 110818 | 2.5% |
| Other values (3) | 103347 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4387832 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1100102 | |
| 1 | 886397 | |
| / | 877548 | |
| 2 | 396674 | 9.0% |
| 9 | 393348 | 9.0% |
| 8 | 143873 | 3.3% |
| 7 | 131859 | 3.0% |
| 5 | 122219 | 2.8% |
| 6 | 121562 | 2.8% |
| 3 | 110818 | 2.5% |
| Other values (25) | 103432 | 2.4% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Fungi |
|---|---|
| 2nd row | Plantae |
| Value | Count | Frequency (%) |
| fungi | 1 | |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| F | 1 | |
| u | 1 | |
| g | 1 | |
| i | 1 | |
| P | 1 | |
| l | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| u | 1 | |
| g | 1 | |
| i | 1 | |
| l | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| F | 1 | |
| u | 1 | |
| g | 1 | |
| i | 1 | |
| P | 1 | |
| l | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| F | 1 | |
| u | 1 | |
| g | 1 | |
| i | 1 | |
| P | 1 | |
| l | 1 | |
| t | 1 | |
| e | 1 |
identificationVerificationStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Fungi-Ascomycota |
|---|
| Value | Count | Frequency (%) |
| fungi-ascomycota | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| F | 1 | 6.2% |
| u | 1 | 6.2% |
| n | 1 | 6.2% |
| g | 1 | 6.2% |
| i | 1 | 6.2% |
| - | 1 | 6.2% |
| A | 1 | 6.2% |
| s | 1 | 6.2% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13 | |
| Uppercase Letter | 2 | 12.5% |
| Dash Punctuation | 1 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| u | 1 | |
| n | 1 | |
| g | 1 | |
| i | 1 | |
| s | 1 | |
| m | 1 | |
| y | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 | |
| A | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15 | |
| Common | 1 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| F | 1 | 6.7% |
| u | 1 | 6.7% |
| n | 1 | 6.7% |
| g | 1 | 6.7% |
| i | 1 | 6.7% |
| A | 1 | 6.7% |
| s | 1 | 6.7% |
| m | 1 | 6.7% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2 | |
| o | 2 | |
| F | 1 | 6.2% |
| u | 1 | 6.2% |
| n | 1 | 6.2% |
| g | 1 | 6.2% |
| i | 1 | 6.2% |
| - | 1 | 6.2% |
| A | 1 | 6.2% |
| s | 1 | 6.2% |
| Other values (4) | 4 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Lichenes-Lecanoromycetes |
|---|
| Value | Count | Frequency (%) |
| lichenes-lecanoromycetes | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | |
| c | 3 | |
| L | 2 | 8.3% |
| n | 2 | 8.3% |
| s | 2 | 8.3% |
| o | 2 | 8.3% |
| i | 1 | 4.2% |
| h | 1 | 4.2% |
| - | 1 | 4.2% |
| a | 1 | 4.2% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21 | |
| Uppercase Letter | 2 | 8.3% |
| Dash Punctuation | 1 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| c | 3 | |
| n | 2 | 9.5% |
| s | 2 | 9.5% |
| o | 2 | 9.5% |
| i | 1 | 4.8% |
| h | 1 | 4.8% |
| a | 1 | 4.8% |
| r | 1 | 4.8% |
| m | 1 | 4.8% |
| Other values (2) | 2 | 9.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 | |
| Common | 1 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| c | 3 | |
| L | 2 | 8.7% |
| n | 2 | 8.7% |
| s | 2 | 8.7% |
| o | 2 | 8.7% |
| i | 1 | 4.3% |
| h | 1 | 4.3% |
| a | 1 | 4.3% |
| r | 1 | 4.3% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | |
| c | 3 | |
| L | 2 | 8.3% |
| n | 2 | 8.3% |
| s | 2 | 8.3% |
| o | 2 | 8.3% |
| i | 1 | 4.2% |
| h | 1 | 4.2% |
| - | 1 | 4.2% |
| a | 1 | 4.2% |
| Other values (4) | 4 |
taxonID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Caliciales |
|---|---|
| 2nd row | Sapindales |
| Value | Count | Frequency (%) |
| caliciales | 1 | |
| sapindales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 3 | |
| i | 3 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 5.0% |
| c | 1 | 5.0% |
| S | 1 | 5.0% |
| p | 1 | 5.0% |
| n | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 3 | |
| i | 3 | |
| e | 2 | |
| s | 2 | |
| c | 1 | 5.6% |
| p | 1 | 5.6% |
| n | 1 | 5.6% |
| d | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 3 | |
| i | 3 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 5.0% |
| c | 1 | 5.0% |
| S | 1 | 5.0% |
| p | 1 | 5.0% |
| n | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 3 | |
| i | 3 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 5.0% |
| c | 1 | 5.0% |
| S | 1 | 5.0% |
| p | 1 | 5.0% |
| n | 1 | 5.0% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 15.5 |
| Mean length | 15.5 |
| Min length | 11 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Lichenes-Physciaceae |
|---|---|
| 2nd row | Sapindaceae |
| Value | Count | Frequency (%) |
| lichenes-physciaceae | 1 | |
| sapindaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6 | |
| a | 5 | |
| c | 4 | |
| i | 3 | |
| h | 2 | 6.5% |
| n | 2 | 6.5% |
| s | 2 | 6.5% |
| L | 1 | 3.2% |
| - | 1 | 3.2% |
| P | 1 | 3.2% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27 | |
| Uppercase Letter | 3 | 9.7% |
| Dash Punctuation | 1 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6 | |
| a | 5 | |
| c | 4 | |
| i | 3 | |
| h | 2 | 7.4% |
| n | 2 | 7.4% |
| s | 2 | 7.4% |
| y | 1 | 3.7% |
| p | 1 | 3.7% |
| d | 1 | 3.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| P | 1 | |
| S | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 | |
| Common | 1 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6 | |
| a | 5 | |
| c | 4 | |
| i | 3 | |
| h | 2 | 6.7% |
| n | 2 | 6.7% |
| s | 2 | 6.7% |
| L | 1 | 3.3% |
| P | 1 | 3.3% |
| y | 1 | 3.3% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6 | |
| a | 5 | |
| c | 4 | |
| i | 3 | |
| h | 2 | 6.5% |
| n | 2 | 6.5% |
| s | 2 | 6.5% |
| L | 1 | 3.2% |
| - | 1 | 3.2% |
| P | 1 | 3.2% |
| Other values (4) | 4 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Physcia |
|---|---|
| 2nd row | Paullinia |
| Value | Count | Frequency (%) |
| physcia | 1 | |
| paullinia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 3 | |
| P | 2 | |
| l | 2 | |
| h | 1 | 6.2% |
| y | 1 | 6.2% |
| s | 1 | 6.2% |
| c | 1 | 6.2% |
| u | 1 | 6.2% |
| n | 1 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 2 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 3 | |
| l | 2 | |
| h | 1 | 7.1% |
| y | 1 | 7.1% |
| s | 1 | 7.1% |
| c | 1 | 7.1% |
| u | 1 | 7.1% |
| n | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 3 | |
| P | 2 | |
| l | 2 | |
| h | 1 | 6.2% |
| y | 1 | 6.2% |
| s | 1 | 6.2% |
| c | 1 | 6.2% |
| u | 1 | 6.2% |
| n | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 3 | |
| P | 2 | |
| l | 2 | |
| h | 1 | 6.2% |
| y | 1 | 6.2% |
| s | 1 | 6.2% |
| c | 1 | 6.2% |
| u | 1 | 6.2% |
| n | 1 | 6.2% |
scientificName
Text
| Distinct | 376061 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 224 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 105 |
|---|---|
| Median length | 90 |
| Mean length | 28.47094744 |
| Min length | 2 |
Unique
| Unique | 133003 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | Plantago psyllium L. |
|---|---|
| 2nd row | Shorea platycarpa Heim |
| 3rd row | Plantago psyllium L. |
| 4th row | Agathis borneensis Warb. |
| 5th row | Plantago psyllium L. |
| Value | Count | Frequency (%) |
| l | 1223482 | 6.8% |
| 361839 | 2.0% | |
| ex | 258858 | 1.4% |
| var | 234755 | 1.3% |
| blume | 178015 | 1.0% |
| subsp | 159935 | 0.9% |
| dc | 110044 | 0.6% |
| benth | 87621 | 0.5% |
| indet | 79377 | 0.4% |
| miq | 74956 | 0.4% |
| Other values (123549) | 15222043 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13264086 | 9.3% |
| 12971975 | 9.1% | |
| i | 10267217 | 7.2% |
| e | 8900034 | 6.2% |
| r | 8009635 | 5.6% |
| l | 7115999 | 5.0% |
| s | 6980933 | 4.9% |
| o | 6762853 | 4.7% |
| n | 6461888 | 4.5% |
| . | 6448256 | 4.5% |
| Other values (122) | 55728696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 106479342 | |
| Uppercase Letter | 13376376 | 9.4% |
| Space Separator | 12971978 | 9.1% |
| Other Punctuation | 6905531 | 4.8% |
| Open Punctuation | 1546480 | 1.1% |
| Close Punctuation | 1546471 | 1.1% |
| Dash Punctuation | 67960 | < 0.1% |
| Math Symbol | 9008 | < 0.1% |
| Decimal Number | 8413 | < 0.1% |
| Connector Punctuation | 7 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13264086 | |
| i | 10267217 | 9.6% |
| e | 8900034 | 8.4% |
| r | 8009635 | 7.5% |
| l | 7115999 | 6.7% |
| s | 6980933 | 6.6% |
| o | 6762853 | 6.4% |
| n | 6461888 | 6.1% |
| u | 6433194 | 6.0% |
| t | 5317010 | 5.0% |
| Other values (49) | 26966493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1812034 | |
| C | 1196963 | 8.9% |
| S | 1136590 | 8.5% |
| B | 1026418 | 7.7% |
| P | 856932 | 6.4% |
| M | 848111 | 6.3% |
| A | 844635 | 6.3% |
| H | 742314 | 5.5% |
| D | 685066 | 5.1% |
| R | 626066 | 4.7% |
| Other values (26) | 3601247 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6448256 | |
| & | 361669 | 5.2% |
| ' | 79999 | 1.2% |
| , | 15080 | 0.2% |
| " | 323 | < 0.1% |
| ? | 144 | < 0.1% |
| ! | 37 | < 0.1% |
| / | 15 | < 0.1% |
| • | 2 | < 0.1% |
| ; | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5692 | |
| 2 | 1705 | 20.3% |
| 3 | 195 | 2.3% |
| 0 | 183 | 2.2% |
| 6 | 136 | 1.6% |
| 7 | 131 | 1.6% |
| 4 | 125 | 1.5% |
| 5 | 94 | 1.1% |
| 8 | 77 | 0.9% |
| 9 | 75 | 0.9% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 8996 | |
| = | 6 | 0.1% |
| + | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12971975 | ||
| 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1539498 | |
| [ | 6982 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1539489 | |
| ] | 6982 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 67960 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119855718 | |
| Common | 23055854 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13264086 | 11.1% |
| i | 10267217 | 8.6% |
| e | 8900034 | 7.4% |
| r | 8009635 | 6.7% |
| l | 7115999 | 5.9% |
| s | 6980933 | 5.8% |
| o | 6762853 | 5.6% |
| n | 6461888 | 5.4% |
| u | 6433194 | 5.4% |
| t | 5317010 | 4.4% |
| Other values (85) | 40342869 |
Common
| Value | Count | Frequency (%) |
| 12971975 | ||
| . | 6448256 | |
| ( | 1539498 | 6.7% |
| ) | 1539489 | 6.7% |
| & | 361669 | 1.6% |
| ' | 79999 | 0.3% |
| - | 67960 | 0.3% |
| , | 15080 | 0.1% |
| × | 8996 | < 0.1% |
| ] | 6982 | < 0.1% |
| Other values (27) | 15950 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 142730005 | |
| None | 181561 | 0.1% |
| Punctuation | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 13264086 | 9.3% |
| 12971975 | 9.1% | |
| i | 10267217 | 7.2% |
| e | 8900034 | 6.2% |
| r | 8009635 | 5.6% |
| l | 7115999 | 5.0% |
| s | 6980933 | 4.9% |
| o | 6762853 | 4.7% |
| n | 6461888 | 4.5% |
| . | 6448256 | 4.5% |
| Other values (75) | 55547129 |
None
| Value | Count | Frequency (%) |
| ü | 83317 | |
| é | 43766 | |
| ö | 14267 | 7.9% |
| × | 8996 | 5.0% |
| ä | 7026 | 3.9% |
| ó | 4777 | 2.6% |
| á | 4688 | 2.6% |
| è | 3961 | 2.2% |
| ø | 2863 | 1.6% |
| ç | 864 | 0.5% |
| Other values (35) | 7036 | 3.9% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 4 | |
| • | 2 |
parentNameUsage
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6.5 |
| Mean length | 6.5 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | caesia |
|---|---|
| 2nd row | elegans |
| Value | Count | Frequency (%) |
| caesia | 1 | |
| elegans | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| s | 2 | |
| c | 1 | 7.7% |
| i | 1 | 7.7% |
| l | 1 | 7.7% |
| g | 1 | 7.7% |
| n | 1 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| s | 2 | |
| c | 1 | 7.7% |
| i | 1 | 7.7% |
| l | 1 | 7.7% |
| g | 1 | 7.7% |
| n | 1 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| s | 2 | |
| c | 1 | 7.7% |
| i | 1 | 7.7% |
| l | 1 | 7.7% |
| g | 1 | 7.7% |
| n | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| e | 3 | |
| s | 2 | |
| c | 1 | 7.7% |
| i | 1 | 7.7% |
| l | 1 | 7.7% |
| g | 1 | 7.7% |
| n | 1 | 7.7% |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 5019780 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | species |
|---|---|
| 2nd row | species |
| Value | Count | Frequency (%) |
| species | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 4 | |
| e | 4 | |
| p | 2 | |
| c | 2 | |
| i | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 4 | |
| e | 4 | |
| p | 2 | |
| c | 2 | |
| i | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 4 | |
| e | 4 | |
| p | 2 | |
| c | 2 | |
| i | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 4 | |
| e | 4 | |
| p | 2 | |
| c | 2 | |
| i | 2 |
| Distinct | 1414 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 488 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 68 |
| Mean length | 29.82403601 |
| Min length | 8 |
Unique
| Unique | 116 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae|Lamiales|Plantaginaceae |
|---|---|
| 2nd row | Plantae|Malvales|Dipterocarpaceae |
| 3rd row | Plantae|Lamiales|Plantaginaceae |
| 4th row | Plantae|Cupressales|Araucariaceae |
| 5th row | Plantae|Lamiales|Plantaginaceae |
| Value | Count | Frequency (%) |
| plantae|fabales|fabaceae | 308028 | 6.1% |
| plantae|asterales|asteraceae | 302803 | 6.0% |
| plantae|poales|poaceae | 281272 | 5.6% |
| plantae|gentianales|rubiaceae | 189877 | 3.8% |
| plantae|poales|cyperaceae | 141951 | 2.8% |
| plantae|lamiales|lamiaceae | 116077 | 2.3% |
| plantae|rosales|rosaceae | 114928 | 2.3% |
| plantae|asparagales|orchidaceae | 94113 | 1.9% |
| plantae|malpighiales|euphorbiaceae | 91199 | 1.8% |
| plantae|malvales|malvaceae | 80345 | 1.6% |
| Other values (1415) | 3309436 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 30545157 | |
| e | 22993015 | |
| l | 13087306 | |
| | | 10172681 | 6.8% |
| n | 8204441 | 5.5% |
| t | 7522789 | 5.0% |
| s | 7327084 | 4.9% |
| c | 7056672 | 4.7% |
| P | 6334479 | 4.2% |
| i | 5649209 | 3.8% |
| Other values (50) | 30802772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 123945796 | |
| Uppercase Letter | 15347409 | 10.3% |
| Math Symbol | 10172681 | 6.8% |
| Dash Punctuation | 183013 | 0.1% |
| Other Punctuation | 35969 | < 0.1% |
| Space Separator | 10735 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30545157 | |
| e | 22993015 | |
| l | 13087306 | |
| n | 8204441 | 6.6% |
| t | 7522789 | 6.1% |
| s | 7327084 | 5.9% |
| c | 7056672 | 5.7% |
| i | 5649209 | 4.6% |
| r | 4219211 | 3.4% |
| o | 4137100 | 3.3% |
| Other values (17) | 13203812 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 6334479 | |
| A | 1541788 | 10.0% |
| M | 1150202 | 7.5% |
| C | 1027425 | 6.7% |
| F | 966551 | 6.3% |
| L | 791072 | 5.2% |
| R | 773189 | 5.0% |
| S | 593219 | 3.9% |
| G | 453807 | 3.0% |
| E | 432771 | 2.8% |
| Other values (16) | 1282906 | 8.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 21596 | |
| . | 14373 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 10172681 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 183013 |
Space Separator
| Value | Count | Frequency (%) |
| 10735 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 139293205 | |
| Common | 10402400 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30545157 | |
| e | 22993015 | |
| l | 13087306 | |
| n | 8204441 | 5.9% |
| t | 7522789 | 5.4% |
| s | 7327084 | 5.3% |
| c | 7056672 | 5.1% |
| P | 6334479 | 4.5% |
| i | 5649209 | 4.1% |
| r | 4219211 | 3.0% |
| Other values (43) | 26353842 |
Common
| Value | Count | Frequency (%) |
| | | 10172681 | |
| - | 183013 | 1.8% |
| ? | 21596 | 0.2% |
| . | 14373 | 0.1% |
| 10735 | 0.1% | |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149695604 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 30545157 | |
| e | 22993015 | |
| l | 13087306 | |
| | | 10172681 | 6.8% |
| n | 8204441 | 5.5% |
| t | 7522789 | 5.0% |
| s | 7327084 | 4.9% |
| c | 7056672 | 4.7% |
| P | 6334479 | 4.2% |
| i | 5649209 | 3.8% |
| Other values (49) | 30802771 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
kingdom
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 496 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.982224563 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 4861934 | |
| fungi | 104468 | 2.1% |
| chromista | 37916 | 0.8% |
| eubacteria | 14458 | 0.3% |
| protozoa | 510 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9791210 | |
| n | 4966402 | |
| t | 4914818 | |
| e | 4876392 | |
| P | 4862444 | |
| l | 4861934 | |
| i | 156842 | 0.4% |
| u | 118926 | 0.3% |
| F | 104468 | 0.3% |
| g | 104468 | 0.3% |
| Other values (10) | 287878 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30026496 | |
| Uppercase Letter | 5019286 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9791210 | |
| n | 4966402 | |
| t | 4914818 | |
| e | 4876392 | |
| l | 4861934 | |
| i | 156842 | 0.5% |
| u | 118926 | 0.4% |
| g | 104468 | 0.3% |
| r | 52884 | 0.2% |
| o | 39446 | 0.1% |
| Other values (6) | 143174 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 4862444 | |
| F | 104468 | 2.1% |
| C | 37916 | 0.8% |
| E | 14458 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 35045782 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9791210 | |
| n | 4966402 | |
| t | 4914818 | |
| e | 4876392 | |
| P | 4862444 | |
| l | 4861934 | |
| i | 156842 | 0.4% |
| u | 118926 | 0.3% |
| F | 104468 | 0.3% |
| g | 104468 | 0.3% |
| Other values (10) | 287878 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35045782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9791210 | |
| n | 4966402 | |
| t | 4914818 | |
| e | 4876392 | |
| P | 4862444 | |
| l | 4861934 | |
| i | 156842 | 0.4% |
| u | 118926 | 0.3% |
| F | 104468 | 0.3% |
| g | 104468 | 0.3% |
| Other values (10) | 287878 | 0.8% |
phylum
Text
Missing 
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4742156 |
| Missing (%) | 94.5% |
| Memory size | 38.3 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 22 |
| Mean length | 13.09837695 |
| Min length | 3 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Fungi-Ascomycota |
|---|---|
| 2nd row | Fungi-Ascomycota |
| 3rd row | Fungi-Ascomycota |
| 4th row | Fungi-Ascomycota |
| 5th row | Fungi-Ascomycota |
| Value | Count | Frequency (%) |
| rhodophyta | 69790 | |
| fungi-basidiomycota | 52456 | |
| fungi-ascomycota | 45602 | |
| chlorophyta | 45198 | |
| ochrophyta | 32323 | |
| cyanobacteria | 14344 | 5.2% |
| charophyta | 11679 | 4.2% |
| bacillariophyta | 5278 | 1.9% |
| amoebozoa | 445 | 0.2% |
| oomycota | 248 | 0.1% |
| Other values (19) | 263 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 492110 | |
| a | 381066 | 10.5% |
| h | 323307 | 8.9% |
| t | 277145 | 7.6% |
| y | 277061 | 7.6% |
| i | 228102 | 6.3% |
| c | 196019 | 5.4% |
| p | 164308 | 4.5% |
| d | 122285 | 3.4% |
| n | 112587 | 3.1% |
| Other values (28) | 1062460 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3162532 | |
| Uppercase Letter | 375774 | 10.3% |
| Dash Punctuation | 98144 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 492110 | |
| a | 381066 | |
| h | 323307 | |
| t | 277145 | |
| y | 277061 | |
| i | 228102 | 7.2% |
| c | 196019 | 6.2% |
| p | 164308 | 5.2% |
| d | 122285 | 3.9% |
| n | 112587 | 3.6% |
| Other values (11) | 588542 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 98144 | |
| C | 71252 | |
| R | 69790 | |
| B | 57734 | |
| A | 46053 | |
| O | 32571 | 8.7% |
| E | 92 | < 0.1% |
| M | 72 | < 0.1% |
| P | 34 | < 0.1% |
| Z | 15 | < 0.1% |
| Other values (6) | 17 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 98144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3538306 | |
| Common | 98144 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 492110 | |
| a | 381066 | 10.8% |
| h | 323307 | 9.1% |
| t | 277145 | 7.8% |
| y | 277061 | 7.8% |
| i | 228102 | 6.4% |
| c | 196019 | 5.5% |
| p | 164308 | 4.6% |
| d | 122285 | 3.5% |
| n | 112587 | 3.2% |
| Other values (27) | 964316 |
Common
| Value | Count | Frequency (%) |
| - | 98144 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3636450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 492110 | |
| a | 381066 | 10.5% |
| h | 323307 | 8.9% |
| t | 277145 | 7.6% |
| y | 277061 | 7.6% |
| i | 228102 | 6.3% |
| c | 196019 | 5.4% |
| p | 164308 | 4.5% |
| d | 122285 | 3.4% |
| n | 112587 | 3.1% |
| Other values (28) | 1062460 |
class
Text
Missing 
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4741605 |
| Missing (%) | 94.5% |
| Memory size | 38.3 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 26 |
| Mean length | 15.96505103 |
| Min length | 6 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lichenes- |
|---|---|
| 2nd row | Lichenes- |
| 3rd row | Lichenes- |
| 4th row | Lichenes- |
| 5th row | Lichenes- |
| Value | Count | Frequency (%) |
| florideophyceae | 65290 | |
| fungi-agaricomycetes | 48215 | |
| phaeophyceae | 30367 | |
| ulvophyceae | 29708 | |
| lichenes-lecanoromycetes | 26216 | |
| chlorophyceae | 13735 | 4.9% |
| cyanophyceae | 13043 | 4.7% |
| charophyceae | 7130 | 2.6% |
| fungi-pezizomycetes | 6007 | 2.2% |
| lichenes | 5353 | 1.9% |
| Other values (82) | 33337 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 745781 | |
| o | 397801 | 9.0% |
| c | 389649 | 8.8% |
| a | 323750 | 7.3% |
| y | 285919 | 6.4% |
| h | 262782 | 5.9% |
| i | 255647 | 5.8% |
| r | 180931 | 4.1% |
| p | 175103 | 3.9% |
| n | 153502 | 3.5% |
| Other values (33) | 1270245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3962494 | |
| Uppercase Letter | 375284 | 8.5% |
| Dash Punctuation | 103108 | 2.3% |
| Space Separator | 224 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 745781 | |
| o | 397801 | |
| c | 389649 | |
| a | 323750 | |
| y | 285919 | 7.2% |
| h | 262782 | 6.6% |
| i | 255647 | 6.5% |
| r | 180931 | 4.6% |
| p | 175103 | 4.4% |
| n | 153502 | 3.9% |
| Other values (13) | 791629 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 135947 | |
| L | 62393 | |
| A | 49691 | 13.2% |
| C | 39125 | 10.4% |
| P | 38446 | 10.2% |
| U | 29773 | 7.9% |
| B | 7002 | 1.9% |
| S | 5253 | 1.4% |
| D | 2966 | 0.8% |
| T | 1654 | 0.4% |
| Other values (8) | 3034 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103108 |
Space Separator
| Value | Count | Frequency (%) |
| 224 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4337778 | |
| Common | 103332 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 745781 | |
| o | 397801 | 9.2% |
| c | 389649 | 9.0% |
| a | 323750 | 7.5% |
| y | 285919 | 6.6% |
| h | 262782 | 6.1% |
| i | 255647 | 5.9% |
| r | 180931 | 4.2% |
| p | 175103 | 4.0% |
| n | 153502 | 3.5% |
| Other values (31) | 1166913 |
Common
| Value | Count | Frequency (%) |
| - | 103108 | |
| 224 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4441110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 745781 | |
| o | 397801 | 9.0% |
| c | 389649 | 8.8% |
| a | 323750 | 7.3% |
| y | 285919 | 6.4% |
| h | 262782 | 5.9% |
| i | 255647 | 5.8% |
| r | 180931 | 4.1% |
| p | 175103 | 3.9% |
| n | 153502 | 3.5% |
| Other values (33) | 1270245 |
order
Text
Missing 
| Distinct | 380 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 143842 |
| Missing (%) | 2.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 18 |
| Mean length | 9.414907279 |
| Min length | 1 |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lamiales |
|---|---|
| 2nd row | Malvales |
| 3rd row | Lamiales |
| 4th row | Cupressales |
| 5th row | Lamiales |
| Value | Count | Frequency (%) |
| poales | 469510 | 9.6% |
| malpighiales | 338062 | 6.9% |
| asterales | 336633 | 6.9% |
| fabales | 327880 | 6.7% |
| lamiales | 320256 | 6.6% |
| gentianales | 310878 | 6.4% |
| rosales | 239690 | 4.9% |
| ericales | 188588 | 3.9% |
| caryophyllales | 183655 | 3.8% |
| sapindales | 166440 | 3.4% |
| Other values (371) | 1994349 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7993736 | |
| l | 6487296 | |
| s | 6014249 | |
| e | 5900801 | |
| i | 2811732 | 6.1% |
| o | 1694688 | 3.7% |
| r | 1659025 | 3.6% |
| n | 1397350 | 3.0% |
| p | 1231222 | 2.7% |
| t | 1086092 | 2.4% |
| Other values (39) | 9630332 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41030581 | |
| Uppercase Letter | 4875908 | 10.6% |
| Other Punctuation | 33 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7993736 | |
| l | 6487296 | |
| s | 6014249 | |
| e | 5900801 | |
| i | 2811732 | 6.9% |
| o | 1694688 | 4.1% |
| r | 1659025 | 4.0% |
| n | 1397350 | 3.4% |
| p | 1231222 | 3.0% |
| t | 1086092 | 2.6% |
| Other values (15) | 4754390 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 718332 | |
| P | 707510 | |
| A | 686301 | |
| L | 426986 | |
| F | 382064 | |
| C | 367874 | |
| G | 358040 | |
| R | 326597 | |
| S | 324515 | |
| E | 209320 | 4.3% |
| Other values (12) | 368369 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 33 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45906489 | |
| Common | 34 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7993736 | |
| l | 6487296 | |
| s | 6014249 | |
| e | 5900801 | |
| i | 2811732 | 6.1% |
| o | 1694688 | 3.7% |
| r | 1659025 | 3.6% |
| n | 1397350 | 3.0% |
| p | 1231222 | 2.7% |
| t | 1086092 | 2.4% |
| Other values (37) | 9630298 |
Common
| Value | Count | Frequency (%) |
| ? | 33 | |
| 1 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45906523 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7993736 | |
| l | 6487296 | |
| s | 6014249 | |
| e | 5900801 | |
| i | 2811732 | 6.1% |
| o | 1694688 | 3.7% |
| r | 1659025 | 3.6% |
| n | 1397350 | 3.0% |
| p | 1231222 | 2.7% |
| t | 1086092 | 2.4% |
| Other values (39) | 9630332 |
family
Text
| Distinct | 1406 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1212 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 25 |
| Mean length | 10.7858368 |
| Min length | 1 |
Unique
| Unique | 112 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantaginaceae |
|---|---|
| 2nd row | Dipterocarpaceae |
| 3rd row | Plantaginaceae |
| 4th row | Araucariaceae |
| 5th row | Plantaginaceae |
| Value | Count | Frequency (%) |
| fabaceae | 308028 | 6.1% |
| asteraceae | 302803 | 6.0% |
| poaceae | 281272 | 5.6% |
| rubiaceae | 189877 | 3.8% |
| cyperaceae | 141951 | 2.8% |
| lamiaceae | 116077 | 2.3% |
| rosaceae | 114928 | 2.3% |
| orchidaceae | 94113 | 1.9% |
| euphorbiaceae | 91199 | 1.8% |
| malvaceae | 80345 | 1.6% |
| Other values (1398) | 3308484 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12436459 | |
| e | 11470038 | |
| c | 6036012 | |
| i | 2424988 | 4.5% |
| r | 2326369 | 4.3% |
| o | 2005164 | 3.7% |
| n | 1687186 | 3.1% |
| l | 1617349 | 3.0% |
| t | 1411236 | 2.6% |
| s | 1142952 | 2.1% |
| Other values (46) | 11571724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48926205 | |
| Uppercase Letter | 5076927 | 9.4% |
| Dash Punctuation | 79905 | 0.1% |
| Other Punctuation | 35933 | 0.1% |
| Space Separator | 10507 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12436459 | |
| e | 11470038 | |
| c | 6036012 | |
| i | 2424988 | 5.0% |
| r | 2326369 | 4.8% |
| o | 2005164 | 4.1% |
| n | 1687186 | 3.4% |
| l | 1617349 | 3.3% |
| t | 1411236 | 2.9% |
| s | 1142952 | 2.3% |
| Other values (16) | 6368452 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 805796 | |
| P | 726079 | |
| C | 582509 | |
| R | 446592 | |
| M | 431232 | |
| F | 344071 | |
| L | 301693 | 5.9% |
| S | 263451 | 5.2% |
| B | 214499 | 4.2% |
| E | 208569 | 4.1% |
| Other values (16) | 752436 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 21563 | |
| . | 14370 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 79905 |
Space Separator
| Value | Count | Frequency (%) |
| 10507 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 54003132 | |
| Common | 126345 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12436459 | |
| e | 11470038 | |
| c | 6036012 | |
| i | 2424988 | 4.5% |
| r | 2326369 | 4.3% |
| o | 2005164 | 3.7% |
| n | 1687186 | 3.1% |
| l | 1617349 | 3.0% |
| t | 1411236 | 2.6% |
| s | 1142952 | 2.1% |
| Other values (42) | 11445379 |
Common
| Value | Count | Frequency (%) |
| - | 79905 | |
| ? | 21563 | 17.1% |
| . | 14370 | 11.4% |
| 10507 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54129477 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12436459 | |
| e | 11470038 | |
| c | 6036012 | |
| i | 2424988 | 4.5% |
| r | 2326369 | 4.3% |
| o | 2005164 | 3.7% |
| n | 1687186 | 3.1% |
| l | 1617349 | 3.0% |
| t | 1411236 | 2.6% |
| s | 1142952 | 2.1% |
| Other values (46) | 11571724 |
genus
Text
| Distinct | 20571 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 224 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 8.494074976 |
| Min length | 2 |
Unique
| Unique | 3021 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Plantago |
|---|---|
| 2nd row | Shorea |
| 3rd row | Plantago |
| 4th row | Agathis |
| 5th row | Plantago |
| Value | Count | Frequency (%) |
| indet | 79377 | 1.6% |
| carex | 59786 | 1.2% |
| ficus | 43081 | 0.9% |
| rubus | 36824 | 0.7% |
| taraxacum | 28101 | 0.6% |
| hieracium | 27463 | 0.5% |
| cyperus | 23409 | 0.5% |
| salix | 21702 | 0.4% |
| ranunculus | 21385 | 0.4% |
| euphorbia | 19128 | 0.4% |
| Other values (20562) | 4659308 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5220873 | 12.2% |
| i | 3825339 | 9.0% |
| e | 2967585 | 7.0% |
| r | 2820232 | 6.6% |
| o | 2777638 | 6.5% |
| u | 2380535 | 5.6% |
| s | 2337197 | 5.5% |
| n | 2234005 | 5.2% |
| l | 2185796 | 5.1% |
| t | 1806151 | 4.2% |
| Other values (47) | 14081151 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37536848 | |
| Uppercase Letter | 5019562 | 11.8% |
| Other Punctuation | 79377 | 0.2% |
| Dash Punctuation | 517 | < 0.1% |
| Math Symbol | 192 | < 0.1% |
| Space Separator | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5220873 | |
| i | 3825339 | |
| e | 2967585 | 7.9% |
| r | 2820232 | 7.5% |
| o | 2777638 | 7.4% |
| u | 2380535 | 6.3% |
| s | 2337197 | 6.2% |
| n | 2234005 | 6.0% |
| l | 2185796 | 5.8% |
| t | 1806151 | 4.8% |
| Other values (17) | 8981497 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 691853 | |
| P | 514468 | 10.2% |
| A | 478156 | 9.5% |
| S | 477199 | 9.5% |
| M | 288158 | 5.7% |
| D | 266339 | 5.3% |
| L | 256569 | 5.1% |
| E | 243750 | 4.9% |
| T | 236943 | 4.7% |
| H | 215486 | 4.3% |
| Other values (16) | 1350641 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 79377 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 517 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 192 |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42556410 | |
| Common | 80092 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5220873 | 12.3% |
| i | 3825339 | 9.0% |
| e | 2967585 | 7.0% |
| r | 2820232 | 6.6% |
| o | 2777638 | 6.5% |
| u | 2380535 | 5.6% |
| s | 2337197 | 5.5% |
| n | 2234005 | 5.2% |
| l | 2185796 | 5.1% |
| t | 1806151 | 4.2% |
| Other values (43) | 14001059 |
Common
| Value | Count | Frequency (%) |
| . | 79377 | |
| - | 517 | 0.6% |
| × | 192 | 0.2% |
| 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42636307 | |
| None | 195 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5220873 | 12.2% |
| i | 3825339 | 9.0% |
| e | 2967585 | 7.0% |
| r | 2820232 | 6.6% |
| o | 2777638 | 6.5% |
| u | 2380535 | 5.6% |
| s | 2337197 | 5.5% |
| n | 2234005 | 5.2% |
| l | 2185796 | 5.1% |
| t | 1806151 | 4.2% |
| Other values (45) | 14080956 |
None
| Value | Count | Frequency (%) |
| × | 192 | |
| ë | 3 | 1.5% |
subgenus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 42 |
| Mean length | 42 |
| Min length | 42 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Fimbristylis bisumbellata (Forssk.) Bubani |
|---|
| Value | Count | Frequency (%) |
| fimbristylis | 1 | |
| bisumbellata | 1 | |
| forssk | 1 | |
| bubani | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 5 | |
| b | 4 | 9.5% |
| l | 3 | 7.1% |
| a | 3 | 7.1% |
| 3 | 7.1% | |
| F | 2 | 4.8% |
| u | 2 | 4.8% |
| t | 2 | 4.8% |
| r | 2 | 4.8% |
| Other values (10) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Space Separator | 3 | 7.1% |
| Uppercase Letter | 3 | 7.1% |
| Open Punctuation | 1 | 2.4% |
| Other Punctuation | 1 | 2.4% |
| Close Punctuation | 1 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 5 | |
| b | 4 | |
| l | 3 | |
| a | 3 | |
| u | 2 | 6.1% |
| t | 2 | 6.1% |
| r | 2 | 6.1% |
| m | 2 | 6.1% |
| y | 1 | 3.0% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36 | |
| Common | 6 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 5 | |
| b | 4 | |
| l | 3 | |
| a | 3 | |
| F | 2 | 5.6% |
| u | 2 | 5.6% |
| t | 2 | 5.6% |
| r | 2 | 5.6% |
| m | 2 | 5.6% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| ( | 1 | 16.7% |
| . | 1 | 16.7% |
| ) | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 5 | |
| b | 4 | 9.5% |
| l | 3 | 7.1% |
| a | 3 | 7.1% |
| 3 | 7.1% | |
| F | 2 | 4.8% |
| u | 2 | 4.8% |
| t | 2 | 4.8% |
| r | 2 | 4.8% |
| Other values (10) | 11 |
specificEpithet
Text
Missing 
| Distinct | 74468 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 420613 |
| Missing (%) | 8.4% |
| Memory size | 38.3 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 23 |
| Mean length | 9.008492186 |
| Min length | 2 |
Unique
| Unique | 19361 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | psyllium |
|---|---|
| 2nd row | platycarpa |
| 3rd row | psyllium |
| 4th row | borneensis |
| 5th row | psyllium |
| Value | Count | Frequency (%) |
| vulgaris | 23406 | 0.5% |
| palustris | 17443 | 0.4% |
| arvensis | 16770 | 0.4% |
| indica | 15214 | 0.3% |
| officinalis | 15193 | 0.3% |
| repens | 13144 | 0.3% |
| maritima | 12040 | 0.3% |
| alpina | 11689 | 0.3% |
| tomentosa | 11026 | 0.2% |
| montana | 10538 | 0.2% |
| Other values (74399) | 4454027 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5674666 | |
| i | 4678236 | |
| s | 3099567 | 7.5% |
| e | 2899018 | 7.0% |
| r | 2736809 | 6.6% |
| l | 2697840 | 6.5% |
| n | 2575102 | 6.2% |
| u | 2534175 | 6.1% |
| o | 2373564 | 5.7% |
| t | 2185739 | 5.3% |
| Other values (70) | 9976862 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41389612 | |
| Dash Punctuation | 30883 | 0.1% |
| Math Symbol | 8467 | < 0.1% |
| Space Separator | 1347 | < 0.1% |
| Other Punctuation | 850 | < 0.1% |
| Uppercase Letter | 146 | < 0.1% |
| Decimal Number | 105 | < 0.1% |
| Open Punctuation | 83 | < 0.1% |
| Close Punctuation | 81 | < 0.1% |
| Initial Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5674666 | |
| i | 4678236 | |
| s | 3099567 | 7.5% |
| e | 2899018 | 7.0% |
| r | 2736809 | 6.6% |
| l | 2697840 | 6.5% |
| n | 2575102 | 6.2% |
| u | 2534175 | 6.1% |
| o | 2373564 | 5.7% |
| t | 2185739 | 5.3% |
| Other values (26) | 9934896 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 55 | |
| M | 43 | |
| F | 8 | 5.5% |
| I | 6 | 4.1% |
| A | 6 | 4.1% |
| E | 6 | 4.1% |
| D | 5 | 3.4% |
| S | 4 | 2.7% |
| N | 3 | 2.1% |
| W | 2 | 1.4% |
| Other values (6) | 8 | 5.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 765 | |
| ? | 36 | 4.2% |
| " | 32 | 3.8% |
| ! | 11 | 1.3% |
| • | 2 | 0.2% |
| * | 1 | 0.1% |
| & | 1 | 0.1% |
| / | 1 | 0.1% |
| % | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 44 | |
| 2 | 30 | |
| 3 | 15 | 14.3% |
| 4 | 4 | 3.8% |
| 7 | 4 | 3.8% |
| 8 | 3 | 2.9% |
| 9 | 3 | 2.9% |
| 5 | 1 | 1.0% |
| 0 | 1 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 8464 | |
| = | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1344 | ||
| 3 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 69 | |
| [ | 14 | 16.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 67 | |
| ] | 14 | 17.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30883 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41389758 | |
| Common | 41820 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5674666 | |
| i | 4678236 | |
| s | 3099567 | 7.5% |
| e | 2899018 | 7.0% |
| r | 2736809 | 6.6% |
| l | 2697840 | 6.5% |
| n | 2575102 | 6.2% |
| u | 2534175 | 6.1% |
| o | 2373564 | 5.7% |
| t | 2185739 | 5.3% |
| Other values (42) | 9935042 |
Common
| Value | Count | Frequency (%) |
| - | 30883 | |
| × | 8464 | 20.2% |
| 1344 | 3.2% | |
| . | 765 | 1.8% |
| ( | 69 | 0.2% |
| ) | 67 | 0.2% |
| 1 | 44 | 0.1% |
| ? | 36 | 0.1% |
| " | 32 | 0.1% |
| 2 | 30 | 0.1% |
| Other values (18) | 86 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41423019 | |
| None | 8553 | < 0.1% |
| Punctuation | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5674666 | |
| i | 4678236 | |
| s | 3099567 | 7.5% |
| e | 2899018 | 7.0% |
| r | 2736809 | 6.6% |
| l | 2697840 | 6.5% |
| n | 2575102 | 6.2% |
| u | 2534175 | 6.1% |
| o | 2373564 | 5.7% |
| t | 2185739 | 5.3% |
| Other values (56) | 9968303 |
None
| Value | Count | Frequency (%) |
| × | 8464 | |
| ü | 38 | 0.4% |
| ë | 21 | 0.2% |
| ï | 9 | 0.1% |
| ö | 6 | 0.1% |
| é | 5 | 0.1% |
| á | 3 | < 0.1% |
| 3 | < 0.1% | |
| ó | 1 | < 0.1% |
| ä | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 4 | |
| • | 2 |
Missing 
| Distinct | 25248 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 4607995 |
| Missing (%) | 91.8% |
| Memory size | 38.3 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 9.160235753 |
| Min length | 1 |
Unique
| Unique | 9012 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | velutinata |
|---|---|
| 2nd row | mollis |
| 3rd row | bract brevioribus |
| 4th row | vrieseanum |
| 5th row | candollei |
| Value | Count | Frequency (%) |
| angustifolia | 2329 | 0.6% |
| glabra | 2075 | 0.5% |
| pubescens | 1991 | 0.5% |
| vulgaris | 1822 | 0.4% |
| minor | 1585 | 0.4% |
| major | 1573 | 0.4% |
| album | 1571 | 0.4% |
| montana | 1497 | 0.4% |
| alba | 1374 | 0.3% |
| typica | 1327 | 0.3% |
| Other values (25081) | 396330 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 508708 | |
| i | 418276 | |
| s | 281997 | 7.5% |
| e | 267917 | 7.1% |
| l | 258041 | 6.8% |
| r | 245993 | 6.5% |
| u | 238357 | 6.3% |
| n | 229721 | 6.1% |
| o | 218434 | 5.8% |
| t | 200564 | 5.3% |
| Other values (56) | 904058 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3767394 | |
| Dash Punctuation | 2480 | 0.1% |
| Space Separator | 1689 | < 0.1% |
| Other Punctuation | 353 | < 0.1% |
| Uppercase Letter | 96 | < 0.1% |
| Open Punctuation | 22 | < 0.1% |
| Close Punctuation | 22 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 508708 | |
| i | 418276 | |
| s | 281997 | 7.5% |
| e | 267917 | 7.1% |
| l | 258041 | 6.8% |
| r | 245993 | 6.5% |
| u | 238357 | 6.3% |
| n | 229721 | 6.1% |
| o | 218434 | 5.8% |
| t | 200564 | 5.3% |
| Other values (28) | 899386 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 28 | |
| A | 11 | 11.5% |
| H | 10 | 10.4% |
| M | 9 | 9.4% |
| L | 7 | 7.3% |
| P | 7 | 7.3% |
| C | 5 | 5.2% |
| V | 5 | 5.2% |
| O | 4 | 4.2% |
| G | 2 | 2.1% |
| Other values (5) | 8 | 8.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 192 | |
| . | 128 | |
| ! | 19 | 5.4% |
| & | 10 | 2.8% |
| ? | 4 | 1.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 | |
| [ | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 | |
| ] | 10 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2480 |
Space Separator
| Value | Count | Frequency (%) |
| 1689 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 8 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3767490 | |
| Common | 4576 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 508708 | |
| i | 418276 | |
| s | 281997 | 7.5% |
| e | 267917 | 7.1% |
| l | 258041 | 6.8% |
| r | 245993 | 6.5% |
| u | 238357 | 6.3% |
| n | 229721 | 6.1% |
| o | 218434 | 5.8% |
| t | 200564 | 5.3% |
| Other values (43) | 899482 |
Common
| Value | Count | Frequency (%) |
| - | 2480 | |
| 1689 | ||
| ' | 192 | 4.2% |
| . | 128 | 2.8% |
| ! | 19 | 0.4% |
| ( | 12 | 0.3% |
| ) | 12 | 0.3% |
| [ | 10 | 0.2% |
| ] | 10 | 0.2% |
| & | 10 | 0.2% |
| Other values (3) | 14 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3771989 | |
| None | 77 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 508708 | |
| i | 418276 | |
| s | 281997 | 7.5% |
| e | 267917 | 7.1% |
| l | 258041 | 6.8% |
| r | 245993 | 6.5% |
| u | 238357 | 6.3% |
| n | 229721 | 6.1% |
| o | 218434 | 5.8% |
| t | 200564 | 5.3% |
| Other values (43) | 903981 |
None
| Value | Count | Frequency (%) |
| ë | 26 | |
| é | 11 | |
| × | 8 | 10.4% |
| ü | 7 | 9.1% |
| ê | 5 | 6.5% |
| ö | 5 | 6.5% |
| û | 4 | 5.2% |
| á | 3 | 3.9% |
| ó | 2 | 2.6% |
| ï | 2 | 2.6% |
| Other values (3) | 4 | 5.2% |
taxonRank
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 224 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.632937601 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | species |
|---|---|
| 2nd row | species |
| 3rd row | species |
| 4th row | species |
| 5th row | species |
| Value | Count | Frequency (%) |
| species | 4187406 | |
| genus | 420365 | 8.4% |
| var | 230817 | 4.6% |
| subsp | 148885 | 3.0% |
| f | 32085 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 9092947 | |
| e | 8795177 | |
| p | 4336291 | |
| c | 4187406 | |
| i | 4187406 | |
| u | 569250 | 1.7% |
| g | 420365 | 1.3% |
| n | 420365 | 1.3% |
| . | 411787 | 1.2% |
| v | 230817 | 0.7% |
| Other values (4) | 642604 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32882628 | |
| Other Punctuation | 411787 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 9092947 | |
| e | 8795177 | |
| p | 4336291 | |
| c | 4187406 | |
| i | 4187406 | |
| u | 569250 | 1.7% |
| g | 420365 | 1.3% |
| n | 420365 | 1.3% |
| v | 230817 | 0.7% |
| a | 230817 | 0.7% |
| Other values (3) | 411787 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 411787 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32882628 | |
| Common | 411787 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 9092947 | |
| e | 8795177 | |
| p | 4336291 | |
| c | 4187406 | |
| i | 4187406 | |
| u | 569250 | 1.7% |
| g | 420365 | 1.3% |
| n | 420365 | 1.3% |
| v | 230817 | 0.7% |
| a | 230817 | 0.7% |
| Other values (3) | 411787 | 1.3% |
Common
| Value | Count | Frequency (%) |
| . | 411787 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33294415 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 9092947 | |
| e | 8795177 | |
| p | 4336291 | |
| c | 4187406 | |
| i | 4187406 | |
| u | 569250 | 1.7% |
| g | 420365 | 1.3% |
| n | 420365 | 1.3% |
| . | 411787 | 1.2% |
| v | 230817 | 0.7% |
| Other values (4) | 642604 | 1.9% |
Missing 
| Distinct | 65242 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 355313 |
| Missing (%) | 7.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 77 |
|---|---|
| Median length | 70 |
| Mean length | 9.034118996 |
| Min length | 1 |
Unique
| Unique | 15043 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | L. |
|---|---|
| 2nd row | Heim |
| 3rd row | L. |
| 4th row | Warb. |
| 5th row | L. |
| Value | Count | Frequency (%) |
| l | 1269845 | 17.0% |
| 360987 | 4.8% | |
| ex | 256772 | 3.4% |
| blume | 179897 | 2.4% |
| dc | 108295 | 1.4% |
| benth | 85823 | 1.1% |
| miq | 72408 | 1.0% |
| r.br | 65429 | 0.9% |
| willd | 61790 | 0.8% |
| merr | 59114 | 0.8% |
| Other values (13133) | 4949569 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 5991115 | 14.2% |
| 2806041 | 6.7% | |
| e | 2684191 | 6.4% |
| r | 1931749 | 4.6% |
| l | 1914225 | 4.5% |
| L | 1600647 | 3.8% |
| a | 1554933 | 3.7% |
| ) | 1480364 | 3.5% |
| ( | 1480364 | 3.5% |
| n | 1371375 | 3.3% |
| Other values (100) | 19324364 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21643314 | |
| Uppercase Letter | 8321416 | 19.7% |
| Other Punctuation | 6375580 | 15.1% |
| Space Separator | 2806041 | 6.7% |
| Close Punctuation | 1480387 | 3.5% |
| Open Punctuation | 1480387 | 3.5% |
| Dash Punctuation | 32237 | 0.1% |
| Decimal Number | 4 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2684191 | |
| r | 1931749 | 8.9% |
| l | 1914225 | 8.8% |
| a | 1554933 | 7.2% |
| n | 1371375 | 6.3% |
| o | 1354940 | 6.3% |
| i | 1277795 | 5.9% |
| u | 1109030 | 5.1% |
| t | 1098013 | 5.1% |
| h | 1075013 | 5.0% |
| Other values (45) | 6272050 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1600647 | |
| B | 817292 | 9.8% |
| S | 648729 | 7.8% |
| M | 543932 | 6.5% |
| H | 513761 | 6.2% |
| C | 493195 | 5.9% |
| R | 431939 | 5.2% |
| D | 408905 | 4.9% |
| A | 358247 | 4.3% |
| P | 336475 | 4.0% |
| Other values (26) | 2168294 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5991115 | |
| & | 360876 | 5.7% |
| , | 14711 | 0.2% |
| ' | 8762 | 0.1% |
| ? | 104 | < 0.1% |
| ! | 7 | < 0.1% |
| ; | 2 | < 0.1% |
| / | 2 | < 0.1% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 1 | |
| 7 | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1480364 | |
| ] | 23 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1480364 | |
| [ | 23 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2806041 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32237 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29964730 | |
| Common | 12174638 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2684191 | 9.0% |
| r | 1931749 | 6.4% |
| l | 1914225 | 6.4% |
| L | 1600647 | 5.3% |
| a | 1554933 | 5.2% |
| n | 1371375 | 4.6% |
| o | 1354940 | 4.5% |
| i | 1277795 | 4.3% |
| u | 1109030 | 3.7% |
| t | 1098013 | 3.7% |
| Other values (81) | 14067832 |
Common
| Value | Count | Frequency (%) |
| . | 5991115 | |
| 2806041 | ||
| ) | 1480364 | 12.2% |
| ( | 1480364 | 12.2% |
| & | 360876 | 3.0% |
| - | 32237 | 0.3% |
| , | 14711 | 0.1% |
| ' | 8762 | 0.1% |
| ? | 104 | < 0.1% |
| ] | 23 | < 0.1% |
| Other values (9) | 41 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41970851 | |
| None | 168517 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 5991115 | 14.3% |
| 2806041 | 6.7% | |
| e | 2684191 | 6.4% |
| r | 1931749 | 4.6% |
| l | 1914225 | 4.6% |
| L | 1600647 | 3.8% |
| a | 1554933 | 3.7% |
| ) | 1480364 | 3.5% |
| ( | 1480364 | 3.5% |
| n | 1371375 | 3.3% |
| Other values (61) | 19155847 |
None
| Value | Count | Frequency (%) |
| ü | 81436 | |
| é | 43052 | |
| ö | 13870 | 8.2% |
| ä | 6673 | 4.0% |
| á | 4502 | 2.7% |
| ó | 4378 | 2.6% |
| è | 3798 | 2.3% |
| ø | 2862 | 1.7% |
| ê | 1148 | 0.7% |
| ç | 862 | 0.5% |
| Other values (29) | 5936 | 3.5% |
vernacularName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 38.3 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ICN |
|---|---|
| 2nd row | ICN |
| 3rd row | ICN |
| 4th row | ICN |
| 5th row | ICN |
| Value | Count | Frequency (%) |
| icn | 5019779 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 5019779 | |
| C | 5019779 | |
| N | 5019779 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15059337 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 5019779 | |
| C | 5019779 | |
| N | 5019779 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15059337 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 5019779 | |
| C | 5019779 | |
| N | 5019779 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15059337 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 5019779 | |
| C | 5019779 | |
| N | 5019779 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 5019781 |
| Missing (%) | > 99.9% |
| Memory size | 38.3 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Poales |
|---|
| Value | Count | Frequency (%) |
| poales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1 | |
| o | 1 | |
| a | 1 | |
| l | 1 | |
| e | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 1 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| a | 1 | |
| l | 1 | |
| e | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| o | 1 | |
| a | 1 | |
| l | 1 | |
| e | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1 | |
| o | 1 | |
| a | 1 | |
| l | 1 | |
| e | 1 | |
| s | 1 |